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ABSTRACT 


This thesis discusses the application of a simulation 
method to determine the accuracy of an approximate distribution 
of the sample cross-correlation between two linear, stationary 
Markov series with known autocorrelations of lag one, p, and 
P, »- The topics discussed are: the simulation of the sample 
cross-correlation distribution; the generation of pseudo-random 
numbers; the statistical testing of Simulation results; the 
determination of the critical values of the approximate 
distribution; the application of the simulated distribution to 
estimate the accuracy in the approximate distribution and its 


critical values. 


The results presented here show that the approximate 
distribution is accurate for low values of the product of 
autocorrelations *p,0, (lo,p,1 = ->)% For high values of the 
autocorrelations and small sample sizes (< 30) of the Markov 
series, however, the approximate distribution appears to have 


too large a variance. 


A practical example is used to illustrate how the 
approximate distribution may be applied to test for correlation 


between two series of data of known autocorrelations. 
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CHAPTER I 


THE APPROXIMATE NULL DISTRIBUTION OF THE SAMPLE CROSS- 


CORRELATION BETWEEN TWO LINEAR STATIONARY MARKOV SERIES. 


The testing for correlation between two variables is often 
desired, but well-known tests for this purpose are generally 
based on the assumption that at least one variable is not 
autocorrelated (that is, there is no interdependence or serial 
correlation between Successive observations of the variable). In 
economic, meteorological, biological and some other time series, 
autocorrelation usually exists. Hence, there is a real need for 


a test for correlation between two autocorrelated series. 


To use hypothesis testing, the null distribution of the 
sample correlation must be available. For two series of 
independent observations, the sample cross-correlation fr is 
known to have the null distribution of the Pearson correlation 
coctficient e(scetekceping §f21) 4) However, Siri botheserves are 
autocorrelated, very little is known of the exact distribution 


ofeare 


Since the mathematical model of a time series is a 


stochastic process, the most promising approach to the 
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satisfactory analysis of time series is the use of stochastic 
processes. McGregor and Bielenstein [30] have derived an 
approximate null distribution of the cross-correlation between 
two autocorrelated series which are generated by stationary, 
linear Markov processes. This approximate distribution depends 
only on the size of the sample taken in each series and’ “tHe 


product of the autocorrelations in the series. 


In order to apply the approximate distribution of r in any 
valid test for correlation, it is necessary to determine the 
accuracy of this distribution and to compute the critical values 
Olag Ls This paper will consider; a) the use of Simulation to 
check the accuracy of the approximate distribution; b) an 
algorithm for evaluating the critical values of r; c) the 
estimation of error bounds for the approximate critical values 
of r; and d) the application of the approximate distribution 


and critical values in some practical examples. 


An attempt at obtaining the accuracy of the approximate 
distribution» iS tiade by comparing this» distribution to> a 
Situlated distribution of the cross-correlation ‘coefficient. 
Simulation of the cross-correlation distribution requires the 
use of a large number of normal random numbers. The simulation 
procedure is described in Chapter 2. Chapter 3 will discuss 
the properties and choice of a pseudo-random number generator 
suitable for use in the simulation. Before the simulated 


‘distribution can be used for any purpose, it is hecessary to 
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determine how closely it represents the true eee ee 
Chapter 4 discusses a series of test criteria which can be used 
to determine the goodness-of-fit of the simulated distribution. 
The comparison of the approximate and simulated distributions, 
and =the estimation of | the|VerrorWin’the approximate critical 
values of r are discussed in Chapter 5. Chapter 6 will consist 
of an example of application of the approximate 
distribution, the conclusion, and discussions of further 


research to improve the simulation efficiency. 
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values observed from two linear, stationary Markov processes 


(that is, first order autoregressive processes) defined by: 
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where Z and Z'. are assumed to be independent, normal, N(0, 1) 


uF 1 
random variables; and the autocorrelations of lag one of the 


processes, op, and Po respectively are assumed to be known. 
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The cross-correlation (or product correlation) between the 


two stationary processes, xX, - Y, is defined by: 


Phy = Cov(X, , ¥,) (1. 2) 
ee ey Ce ee ee ; 
[Var (x,).. Var (Y,) j172 
it is a number between -1 and +1, and is zero when X, and Y, 
are uncorrelated. 
An estimator of Pyy is the sample cross-correlation 
coefficient Tey , given by: 
a n = a 
re PECs KSC et (1. 3) 
n n 
=/8Y).2 Y, - Yy2 )]172 
C ey oe Less Jace) 
where 
iss aes 
ie = — eke and Y = kD A : 
n aociye n ES 
The random variable Tyy is an unbiased estimator of Py it 


Pyy = 0, since E(Lyy) = Pyy = 


Processes of the type given in Eqn. (1.1) are frequently 
used aS sampling models in the studies of economic time series; 
and testing for correlation between two series generated by 
these processes is often desired. It is often required to test 


the null hypothesis that the two processes Xx, ’ Y, are 
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uncorrelated (that is, = 0). For this purpose the 


Pxy 


distribution of the sample cross-correlation r under the 


eee 
assumption Pyy = OO, must be available, as well as the 
corresponding and appropriate critical values for Dy : 


When either {x ,} On ive OD both, force toate 2>.«s, 0, abe 


series of independent observations (that is, either P, or P, 


both are zero so that at least one of the series is a sample 


from a normal distribution ) the sample cross-correlation ans 
has the following null distribution of the Pearson correlation 


coefficient (see Keeping [21]), 


ae) = [ 1- re ] (1.4) 


where r is a value of the random variable rc D(z) 1s the 


ie 
probability density function of Tyy f is the number of 


observations taken in each Markov series, and B is the Beta 


function.) Lu thes case the two “processes Xe + Y. are 


uncorrelated, Pyy = 0. The density function p(r) is a 
symmetrical, bell-shaped curve for n 2 5 so that 
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For an iliustration of the graph of p{r) see Fig. 1.1 with 
le) = 0. It can be shown that 
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The, KUrLosis is -6/(n-1), which tends to zero as n increases. 


For very large n, the distribution is approximately normal. 


When both series {x, } ; {y, } are autocorrelated, the 
Pearson distribution no longer holds and very little is known of 


the exact distribution of the sample correlation Tyy ° 


McGregor and Bielenstein [30] have found an approximate 
probabilty density function of the sample correlation Tyy under 
the hypothesis that the population cross-correlation is zero, 
that, “is, ee 0. For a random sample { (x, 7a ees 


t=1,2,-..,n } generated by the two processes in Eqn. (1.1) the 


approximate density function, p* (r), of Tyy Was derived to be 
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The derived approximate density function, p*(r), depends only 
on the sample size n and the product of the autocorrelations, 


Sblere oet For the special case when the product of the 


aurocorreracvons. is:= zero” (that © ais, = 0) the approximate 


ie 
density function reduces to the null distribution p(r) given in 


Eqn. (1.4). 
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The k moment, n , of the approximate distribution may be 
evaluated by the following general formula : 
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Computation of the above statistics involves numerical 
integration and evaluation of p*(r). The calculation procedure 
UScdmetsee similar to that described in “Appendix A stor the 
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tabulated in Table 1.1. Plots of p*(r) for these values of 
(D, P,P.) are shown in Figs. 1.1 (a), (b), (c), where the same 
scale is used for all three figures. The graphs show that the 
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Moments, Skewness and Kurtosis of Approximate Distribution 
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In order to apply the approximate density function of Cy 


in any valid test for correlation the following points have to 


be considered : 


a) 


b) 


Cc) 


Since the derived density function of Tyy (Eqn. 135) 92s 
only an approximation dependent on the two parameters, the 
Sample size n and the product of the autocorrelations, 
P4P5, it is necessary to determine the accuracy of the 
approximate distribution for various combinations of these 
parameters, particularly for small sample sizes (Since the 
approximation is of O(1/n) and partly for the reason stated 


in (b) below). 


It is also necessary to establish a minimum sample size for 
which the approximate density function may be used with 
acceptable accuracy, since in practice the size of series 


of data or items available for analysis is usually small. 


inp 7onderae to test the null hypothesis that two 


autocorrelated series are uncorrelated, critical vaiues of 


c for appropriate values of Pp under the assumption 
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1) 


2) 


3) 
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values have to be computed from the approximate density 


function for each suitable value of P55 


These points of interest will form the subject of 


Tremeces1 giv .Of san walgorithn, fon veveluating themcrmtical 
values of Tyy using the approximate density function. 
Discussion of the algorithn, including numerical 


integration, will be contained in Appendix A. 


The Simulation of linear stationary Markov series and the 
distribution of the cross-correlation coefficient lyy in 
Chapter 2. The discussion will cover a method of Simulating 
realizations of linear stationary Markov processes, 
including a criterion for determining stationarity of the 
processes, and of obtaining an empirical distribution of 


the sample cross-correlation of these realizations. 


The generation of pseudo-random numbers, inciuding 
techniques of generating normal random numbers and criteria 
for testing the quality of a random number generator in 
Chapter 3. This chapter will also consider the desired 
properties and the choice of a suitable generator for use 


inthe simulation. 
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4) The determination of the error in simulation by a series of 


5) 


6) 


7) 


goodness-of-fit tests and the estimation of a suitable 
sample size to use for a desired accuracy in the simulation 


in Chapter 4. 


The use of Simulation as a method for determining the 
accuracy of the approximate distribution of Tyy ’ 
especially for small sample sizes in Chapter 5. This 


chapter will study methods of comparing the simulated and 


approximate distributions and critical values. 


The estimation of error bounds for critical values of yy 
as computed from the approximate density function in 


Section 5. 4. 


The application of the approximate distribution in a 


practical example in Chapter 6. 
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CHAPTER LL 


SIMULATION OF THE CROSS-CORRELATION DISTRIBUTION. 


2.1 Simulation of Linear Stationary Markov Processes. 


ee ee ee Se en ee —e- ee see ees Ge ea Se oe ee 


In order to study the distribution of cross-correlation 
between two linear, stationary Markov series for a variety of 
values of sample sizes and autocorrelations, only simulation 
provides ready access to a large number of these cross- 


correlations. 


To Simulate realizations of a stationary process of the 


forn, 


x = op x + Z (2:1) 


the following requirements are evident : 


aypan ainitialestabtang Value, =x, "3 


b) a "good' random number generator which will produce 


independent normal random numbers ; 


c) a criterion for determining the stage at which the process 


may be considered to have become stationary. 
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The starting value XQ may be arbitrarily chosen as it 
will be shown that the behaviour of the stationary Markov 


process is independent of eo 


Chapter 3 will discuss the choice of a ‘good! random number 
generator and will contain a description of the random generator 


that will be used in the simulation. 


The criterion for determining stationarity of the process 


in Eqn. (2.1) will be discussed in the following section. 
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where k; and K are integers, depend only on the time lag 


[Ceo en Ciel. 


Consider the first order autoregressive process defined by 
Xe Oe ze (2.2) 


where p is the autocorrelation of lag one of the process 
and {Z+ } is a sequence of independent random variables of known 
constant mean Ug and constant variance o% . By successive 


substitution, Eqn. (2.2) may be rewritten as 


t—10 5 : 
es wea CC ma Cet es (2. 3) 
where Xo is the initial value of the process. Taking 
expectation, we have 
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since {24} are independent random variables of constant mean 


and variance. The covariance of X>,, 


expressed as follows ; 
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uSing the result obtained in Eqn. (2.5). 


Fornesthes process in Eqn. (2.2) to be stationary, the 
mean, E(X;), and variance, Var(X;), must be independent of t, 
and the covariance, Cov(X; , X5), must depend only on the lag 
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Hence, the process (2.2) is stationary only after a sufficient 
number of terms has been generated. The time point at which the 
process may be considered to have stabilized can be determined 
by setting o” equal to a very small value (that is, a numerical 
Zero). Ln the Simulation the following condition for 


Stationarity is used : 


f 
P = 10-8 . (2.7) 
Hence, the time point t, after which the process X_ 
stabilizes is given by 
ii = ine10s2 7 Lone ipo ie. (2.8) 
At time point t,. we may write 
E (xX; ) = te / a Le ae) ten 
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Where € 7, €° ,€ are small error terms (of the order of 
10-8). Since the errors are small we may assume that at this 


time point we have a stationary series. 


As can be seen from Eqn. (2.5) and (2.6) , the variance and 


covariance are independent of the initial value Xx and by 


0? 
setting p* equal to a numerical zero as a condition for 
stationarity the mean becomes independent of X, . Hence, in the 


stationary state the process X+ does not depend on the 


starting value X,. 
FOLet he Case wheres {25} is a sequence of independent 


N(0,1) random variables we have as t tends to infinity, 
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To obtain empirical distributions of the cross-correlations 
between autocorrelated series, a set of observations of two 
Series 1S generated by means of processes of the form in 
Eqn.(2.2) and the sample cross-correlation between these series 
is computed using the formula given below. The following 


notation is used : 


Oe linear, stationary Markov processes defined by 


Mee Ee ony Bee 


(2.9) 
Yer), Se ge cea ane 
Py p, - known autocorrelations of lag one 
> 
Z A zt - independent normal N(0,1) random variables 
he rn ey - time-points at which Ce ae Fo respectively, 


reach stationarity as determined by Eqn. (2.8) 


n - the number of realizations of each of the 
processes Xo, Ye that is, the sample size 


or length of the time series 
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Tyy - the sample cross-correlation between the X 


and ¥ processes 


Ve - a value of the random variable Iyy 


N - the number of simulated observations r of lyy: 


The following procedure is used to simulate N values of 
the sample cross-correlation between two series generated by Xi, 
Y¥; 3 


types ti rot et asucescsi ve wvalucsaor {x,} are generated and 


discarded. 


2) The next n generated values are then taken to be the sample 


observations of the {x} series. 


3) The above process is repeated to obtain a sample of n 


observations of the {y, } series. 


4) The values of the sample cross-correlation Tyy fore this 
partictilar CK Yea) sample is then evaluated by the 


following formula : 
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5) Steps (2) , (3) and (4) are repeated N times to obtain N 
Simulated values of the Sample cross-correlation 


coefficient yy between the {x} and {y¥} series. 


The sample of N observations of fC generated by this 
procedure is then organized and summarized to obtain the 


following : 
1) A frequency table as shown in Tables 4.2 and 4.3. 


2) A graph of the relative frequency distribution, 
representing the empirical probability density function of 


Tyyr as shown in Figs. 4.2 and 4.3. 


3) A graph of the cumulative frequency distribution, 
representing the empirical cumulative distribution function 


of Lys as shown also in Figs. 4.2 and 4.3. 
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4) The kth moment, m , about the mean of the empirical 


distribution as given by the general formula : 


N y's 
m = C 2s Fy m,) ] i N 


5) Skewness and kurtosis as given by : 


Skewness = De w/e ) se 

Kurtosis = Dy je (mz )2 

The Simulated distributions of cross-correlation for 
ae = 0, n= 10, 30 and N = 7000 are shown in Figs. 4.2 


and 4.3 , and tabulated in Tables 4.2 and 4.3. Table 4&.4 shows 
the moments, variance, skewness and kurtosis computed for these 


arstriabutitons. 


Digressing from the consideration of two processes, 
consider for a moment the behaviour of an autoregressive process 


of (lag one by atself. Fig. 2.1. shows.a series of 100, terms 


generated according to Eqn.(2.9) with pS ai) and the 
corresponding observed autocorrelations Tyy (k) of lags 
Kel 2 ele 2 0. figs 2.2 demonstrates these values for, Po = 7—. 5. 


The autocorrelation values of Cyy (k) and Cyy(k) are given in 
Table 2.1. The sample autocorrelation of lag k for the 


X process is given by : 
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The theoretical autocorrelation of lag k is given by 


rn ep ee (2.11) 
For the series with P, = ans a the theoretical 
autocorrelation function (acf) is given by Pyy (k) = 5 lk which 


decays to zero exponentially with increasing lag, as can be 
observed from the plots of the sample acf in Fig. 2.1(b). For 
the series with P5= -.5, the theoretical acf 1S Pyy (k) = hays 
which also damps out exponentially but oscillates from positive 
to negative values, reflecting the oscillatory nature of the 
series. This behaviour is also indicated by the sample acf in 
Fig. 2.2. From these two examples it can be seen that a process 
with only one nonzero autocorrelation, that of lag 1, has an 
exponentially decaying autocorrelation function. Hence, in 
practice it is often difficult to determine whether or not the 
time series comes from an autoregressive model of order one or 
from some other model. Box and Jenkins [5] have developed 


guidelines for both identification and estimation of time series 


models. 
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Table 2.1 


First 20 Lags of the Sample ACF for Sample of 100 Terms of the 
X, and Yy Processes. 
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CHAPTER III 


PSEUDO-RANDOM NUMBER GENERATION. 


All Monte Carlo methods and most simulation studies depend 
on the use of random numbers, usually in great quantities. 
Hence, the need arises for fast methods of generating large 
humber of random numbers with a fairly wide variety of 


qdastributivon functions. 


Most techniques for generating random numbers from specific 
distributions depend on random numbers uniformly distributed 
OLCT Es LNeCmLintverval &(0 5d). The usual procedure is to generate 
values of uniform random variables from the interval (0,1) and 
by some functions transform these values to random numbers from 
the desired distribution. For example, if y is a uniform 
random number from the interval (0,1) (that is, U(0,1) randon 


number) then 


x = i ARIE Ue a es ae 


is a random number having the exponential distribution with 


parameter \. If yy, Yo are U(0,1) random numbers then 


wh we6lfnda oo cieivera Jpon het akon 35 e200 cla hi 
eGeweR tee te at Aiea rset aya’ anh to san edd < 86" 


f geidavenyp %6- ahetige (ree: er weeine toon add 80s on _} 
- - 


qoodune @htie yfgdet af Alp 2 hacistuite aabued » 40 


teat LI? gotondé | 


JoGS @o82 ooegeiia Bes paleaseney. OT aunpladses 720m 
£65325 ere oe aotn == «sp hewged apd sth rth, 
bred oF HA weer ie, pie ie é4qtq8). Pappedekt #62 an 
Of «4h Lerue*ad- eet bat Guittwries ache e7T6bias ¥o aon! & 
asadse) Gora? oF nasbay aeoha wantaows! Sead Foyt scOz ca 
it 2 2h % Be binganae ie -an gail Leb bested 9 
o& ita) P<ebanadt 40) “thvicsatl ara oat tatene a 
oad? @ 


32 


N 
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1 (=2 in Y, yit-42 (Creys Dy Y, 


(3. 1) 


N 
" 


D) (o2 51h a yi fe Sin 27 Y, 


are a pair of independent standard normal (that is, N(0,1)) 
random numbers. This last set of transfomations is known as the 


Box-Muller method [6]. 


In complex sampling experiments it is useful to be able to 
select random numbers and repeat the calculations as a method of 
checking the results, and increasing the accuracy of the 
experiment. The need for the generation of a random number 
sequence that can be regenerated is self-contradictory because 
of the definition of random numbers. It is, however, possible to 
obtain random numbers by a deterministic method which display 
random behaviour. In practice, a random number is obtained 
usually by a computer program by means of an algorithm which 
Waste generate a sequence of numbers satisfying various 
statistical criteria of randomness. Such a sequence is called 


pseudo-randon. 


Two types of methods of generating random numbers for 
sampling with computers have been proposed, the physical process 


and the arithmetical process. 


In the physical process, the output of some physical device 


which is attached to the computer, such as random noise or pulse 
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generated is converted to a sequence of random dvgits. = This 
method, however, involves practical difficulties such as the 
storage of a large volume of random numbers for check 
calculations and hence, is usually not very applicable for fast 


access of random numbers in present digital machines. 


In the arithmetical process, a sequence of pseudo-random 
numbers is derived using an algorithm and an initial supply of 
random numbers as starting values. The sequence of generated 
humbers is always cyclic (since the number representation in a 
digital computer has a finite number of digits); however, if the 
cycle is long enough this will present no difficulties. The 
generated numbers are deterministic and completely predictable 
as soon as the initial values and computational rules are known. 
Four types of arithmetical processes for generating pseudo- 


random numbers have been used in various studies : 


a) the mid-square method, 
b) the randomization by summation modulo p method, 
c) the sequence of digits in transcendental numbers, 


d) the residue-class or multiplicative congruential method. 


In this thesis only the multiplicative congruential method will 
be considered (see Section 3.4). Descriptions and discussions on 
the other three methods may be found in Jannson [17] and Tocher 


[38]. 


: : ¥ 7 

ae ek 
a ‘ 

“bibs sti 7 we 7 icc: _ 


- dot estriaufsals: te bone 


ett | 
o.»% 


> tvE a ay nha a 
be 7 : 
> shbedes yiseogon ¥biewmuoed or 
a 
tyes 7 i iot Ton j 14 Tin] i we wiseds 5 « vs fut , =o Bees 
oD _™ _ 
,*f jie 4 ee Fil Bo8 Lee <4 wysi Ls at ¢ bid ey at 7 
[afd é pivliephe ie pRase devin aveden i 
7 - 
peiey F Si kta’ Palvasey aasiinud 
il Pe jen"l ‘isi? 4 Re “; = Lap ont mY rE 12 id 2ssdego 
} v i » wba 1-ugGaOD +a 
*  * a - 
oe « S16 4 d ae ar aot al als 
! ; 7 
poldssy a bm a6 dm base 
ao ee 
cs r ' f 4 ° ‘ mc &.3 goon is aor 
TT. | Li ei i § a0 a _ 
j + yi f ae & of: 696 32 Ss Ps rs 7 at acveg tno 
y= ioe ee - 2% . 
Ae 
: Rett ebony of buen cee overt Rtedmdn eabae 
; : 
a a 
_) On 
7 ie e 
~aivo® etanpa-blea eds 1B ia. 
< 
f 7 = — 
 bodtee sinphen feisieson (a HOS ee TRObIaI 8a) (i a 7 
,72 avis 4d igiperdort & i: atinad anv) eat ct sa ¢19 — 2 i 
ad ’ if 
7 B ; : 
wore daidnarapaes stidindiass,9o 9s enste-onhiees o€9 ihe 


» boveda, fezine sietita wvicaail jishos ede elasé elees? eae al 


- 18,8: 


ed os Ge all Bit 


wit cro Tit { ae ae 


#! tbanect® & 


gd) ipa atbinde 
a a i 
. 


mid 


Oe 


- 


nak eoue see) teasbians 
said? ted 
cup a r ‘at 


ee 


oD od 


2 
be ‘ei »* 
- Pans 


7 
_ 


ie 


- 


34 


ee ae ee 


To determine the performance of a random number generator, 
a broad range of statistical properties are investigated. Some 
desirable properties which a ‘good® generator should satisfy and 
which can be tsed for comparison between different generating 


processes are : 


a) Good statistical. behaviour - the generated numbers (in 
large and small samples) must satisfy various statistical 
criteria of randomness and the distribution function of 
these numbers must approximate as closely as _ possible _to 


the desired distribution. 


b) Long period - the period after which the sequence of 
numbers repeat itself should be sufficiently long to ensure 
that the sequence contains enough random numbers tor it to 


be useful in a particular probien. 


Glerapideandashort calcWlatang procecune — the generating sting 
on the computer must be short and the space used for 


storage in core memory small. 


Points (b) and (c) are often easy to study and for most 


generators can be achieved by proper choice of initial values 
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and parameters of the generator. The choice of initial values 


and parameters, however, will depend on the computer used. 


Point (a) is more difficult to study and requires thorough 
investigation in many different respects. The usual procedure is 
to test a sample cf generated numbers by standard statistical 
tests concerning distribution and randomness. However, the 
Suitability of a generator for any study depends upon the 
properties required for that particular use. Hence, a necessary 
coneition for the approval of a sequence of pseudo-random 
humbers is that the numbers pass such statistical tests as are 


relevant for the application under consideration. 


3.3 Statistical Tests for Pseudo-Random Number Generators. 


ee es es ee ee SS SSeS (Boe ee Se ee ee ee ee Se Se Se Se a ee eee 


Ideally these statistical tests should be selected in 
accordance with the. actual applications, since different 
applications are more or less sensitive to different properties 
of the random numbers... In practice, however, most of the 
generators heve a general use as standard routines and they 
consequently have to pass a number of standard tests. The tests 
most often used will be briefly introduced here. Detailed 
descriptions of these tests and their applications may be found 
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a) Tests for the Distribution of Generated Numbers. 


1) 


2) 


3) 


Moments. 


The mean, variance, third and fourth moments, skewness 
and kurtosis are computed for samples of the generated 
humbers and compared with the corresponding values from 


the cevue distribution. 


Goodness-of-Fit of Generated Numbers to a Theoretical 


Distribution. 


A goodness-of-fit test is performed on a sample of the 
generated numbers to determine how closely the generated 
numbers fit the desired distribution. The two most 
commonly used goodness-of-fit tests are : 

i) Chi-square test, and 


ii) Kolmogorov-Smirnov test. 


Gunulative Distribution. 


The sample cumulative distribution is computed for the 
generated numbers and compared with the values from the 


desired theoretical cumulative distribution. 
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4) Order and Small Sample Statistics. 


The y oh Ordersstatisticw of ""aeesanple=ers Pte r th 
Smallest observation in a sample. A useful test for the 
docal * statistitcal, “properties of: “a *igenerators.i1s ~ to 
compute the distribution of the order statistics, range, 
Bean geands! vantance: ffoviO'small “samples “of Size, ‘say, 
4,6,10,16. A lack of randomness would cause these 
observed distributions to deviate from the theoretical 


distributions. 
b) Tests for the Randomness of the Generator. 
1) Serial Correlation. 


A serious type of non-randomness which might be expected 
from the generator is correlation between successive 
numbers and between numbers that are k elements apart. 
Measures of these correlations are the serial 
correlations of lag 1 and lag k. If the numbers are 
random there should be no correlation between them. AS a 
test for randomness the sample correlations are computed 


and compared with the theoretical values. 
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Run Tests. 


Serial effects could also be revealed by SSruneetests aA 
run up (or down) of length p is defined as a subsequence 


oe ORS ee <sx er Or ak 8 oe Dd 


Nor : : F 
ss i eal 1i+p-1 : 1b) ‘ apeieee 


4 (and 
with reversed inequality signs for runs down) in a 


sequence of n random numbers. 


number of runs in the sequence, 


tt 


Let ue 
8 = humber of runs of length p in the sequence. 
Then expressions for expected values are given by Levene 


and Wolfowitz +[394, 


E(r) a (2N gael) ree ; 
Var {r) = C16N = (29) 7 23.0 $ 
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(prt): 1 


Let y (m) be the number of runs above and below the 


mean, then 
eers™) = (N72) #1 - 


These runs are counted by constructing a sequence of N 
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; q ual : 
Srouseew vc hoet hes a7 Sign plus or minus depending on 


whether x uS Greater than 1/2) or lessethane 172 . 
The runs of plus and minus signs are then counted. For 
testing the generated sequence, the observed moments are 


compared with the above formulas. 


Extreme Values. 


In a sequence of random numbers the extreme values are 
expected to be randomly dispersed. The expected number 
of such values is 1 per 10,000 (see Chen [7]) and they 
should follow the Poisson distribution with mean 1. In 
this test the observed cumulative distribution of 
extreme values is compared with the expected value 
computed from the Poisson distribution. The Kolmogorov- 
Smirnov test may be used to compare the two 


distributions. 
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3.4 Multiplicative Congruential Pseudo-Random Number 


multiplicative congruential generator which is based on 


numbe: *heory takes the form 


Xnt+1 = AXy (mod og), N= Oly oy testy (32) 


where A is the multiplier, m is the modulus, and Xigo tse ene 
starting value or seed; X oe X and m are selected integers such 
that Xo <ioes he meee Ore< Xo < m. The sequence {x / m} is 


then taken to be the uniform random number sequence in (0,1). 


The procedure used in this generator is briefly outlined as 


fotlows? - 


1) A beginning value x, (also known as the seed) is chosen, 


the first number of the pseudo-random sequence ; 


Z) Xo is multiplied by the constant multiplier A ; 


3) The product is divided by the modulus m and the remainder 


taken aS a new xXx the next pseudo-random number ; 


ral a 


4) Steps 2 and 3 are repeated, with the new random number in 


PlLacewOl mes for every successive number desired. 
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The nultiplicative generator is one of the most frequently 


used generators and appears to be a Satisfactory generator for 


the following reasons : 


a) 


b) 


Cc) 


It is economical in computing time and memory requirements. 
Only two numbers r and x have to be stored. 
Computation of x can be accomplished by one 
multiplication since, if mis chosen judiciously, taking 


moduli merely means shifting the radix point of the product 
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For proper choice of m, A _ and Xp the sequence of 
random numbers obtained fron Eqn. (3.2) will never 
degenerate (that is, produce any x, equal to zero, which 
would make all subsequent X44 2e00)s and a maximum cycle 


length may be obtained. 


This generator has been well-tested in several studies 
fideo) 7719) 7 126) > land!) tas been found »to “be highly 
satisfactory - the word ‘satisfactory' referring in large 
part to the existence of good statistical properties for 


the sequence of numbers generated. 


The process of searching for a good multiplicative 


generator is focused on determining A and m_ so that the 
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generator behaves well according to the evaluation principles of 


Section 3.2. The following briefly summarizes the conditions 


which these parameters should satisfy : 


1) 


2) 


3) 


4) 


It can be shown from number theory (Fermat-Euler theorem) 
that in order to generates a full length cycle (that is, a 
cycle for which A™ returns to the starting point for each 
repetition) : 


gcd(A,m) = 1 and gcd (x), m) = 1 


From number theory it can be shown that if m iS a prime 
number and Xa pEiMitive root of mm then Eqn. | (3.2) 
generates a full cycle which is a_ permutation of the 
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The operation of taking a congruence involves a division 
and it is desirable to choose m to make this usually 
lengthy operation rapid. For a binary computer the usual 
choices of m are : 

n = 28 or nh = 28 eames 
where 8 is the number of bits of a computer word. In 
these cases, division by m can be replaced by a shift or a 


shift and a subtraction or addition, respectively. 


Choice of m= 28 makes the calculations fast and simple 


but it makes it impossible to attain the largest possible 
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periods. The maximum cycle attainable is of length 28-2 
instead of 2° - The values of id , % giving this cycle 


must satisfy the conditions 


aS tele (mod 8) 
MeLceodd; salle sexy, eo en eee 
5) For m = Dis 1, the longest cycle (that is, period 
m-1) will be given when 8 is chosen so that 2° =m 1 
is a prime and X is a primitive root of ao ; then the 
cyclic, length, is 2° =e | Noter thar 2° =a 3) is prime 


only if 8 is prime. 


6) 2» should be large so as to prevent a small value ne 


Th 
being followed by small values Ax,_, cS ate rene gees 
A large value of i causes Ax, to exceed m and so Xx el 


is net necessarily small. 


Some of the multiplicative generators that have been 
investigated by various authors and tested for use with 
different types of binary computers are listed below in Table 
3.1. In most cases the author has either made reference to or 
based the choice of the parameters for the generator on the 
recommendations of Coveyou and MacPherson [8], who have given 
one of the few analytical evaluations of pseudo-random number 


generators in the literature. 
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Table 3.1 
Some Pseudo-Random Number Generators for Binary Computers. 


Author (s) Computer 


S.Gorenstein 


D.Y.Downham odd 
F.D.K.Roberts integer 


G.Marsaglia IBM 360 FORTRAN random 
Dehe Duay: IBM 7094 | FORTRAN + integer 
SRU 1108 | FORTRAN 


D.W.Hutchinson| IBM 7094 


sheLewis IBM randon 


P.A 
A.S.Goodman 360/67 integer 
J-M.Milier 


D.S.Seraphin 


Oakridge 
Laboratory 


E.H.Chen FORTRAN odd 
integer 


Weaeeestands for {not available". 


* generating time per number. 
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In the simulation of the cross-correlation distribution we 
are required to use standard normal N(0,1) random numbers. 
Hence, we Shall now consider some techniques of generating 
normal random numbers which are based on the multiplicative 


congruential method. 
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The random normal number generator which was proposed by 
E.H.Chen [7] is based on the combination of two multiplicative 
congruential generators. This dual type of generator has also 


been suggested by Kronmal [23] and, Maclaren and Marsaglia [ 28]. 
The multiplicative congruential generator used by Chen is 


of the forn 


KR. = R 


A pen 2a etiky ee ocue=2) (3. 3) 


where p is a positive integer, Fy Re Oe SS en 
K is an odd integer, 
Ro - the starting number, is a random odd integer. 


Uniform random variables from (0,1) are then obtained by 


Ure sae Re 72 (2335-1) (3.4) 
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Two independent random standard normal variables are produced 
from two independent uniform variables, U, and Ua by, the 


Box and Muller transformation : 


b< 
iT] 


5 / 
(-2 ln U, )1/2 Cos(2 70, ) 


(3.5) 


ta] 
tl 


(a2eL neue y2/2  sin(2 ™U, ) 
Normal variables with other means and variances may be obtained 


by suitable linear transformations of the X's. 


Good statistical behaviour of the generator in Eqn. (3. 3) 
is dependent on the careful choice of the values of p and K. 
The performance of various combinations of p and kK with 
respect to serial correlation (lag 1), mean and variance for 
the normal variables was investigated by Chen. In order to 
improve serial correlation a dual type of generator of the 


following form was tried and tested by Chen: 


bo 
=N 
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R Tei 1 (235084 23) (mod 231) 
(386) 


RR py s(2 2804 %3)o] (222) +3) (mod 231) . 
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This generator was found to be satisfactory relative to several 


criteria for testing normality and randomness. The periods of 
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the parst and second generators are 229 sand «(26> ), 
respectively. The combined period of the dual generator is at 


least 231, 


A brief account of the series of statistical tests 
performed by Chen on his random number generator and the results 
obtained is given in the following section. To test the 
generator, ten trials each of sample size n = 106 were 
successively generated with the first pair of starting integers 
randomly chosen. For each trial the series of tests listed below 
was performed. As a further check on the performance of the 
generator we repeated some of these tests on the generator using 
Sample sizes of 40,000 , generated with the same pairs _of 
starting integers as those used by Chen. The results obtained 


were compared with those of Chen's. 


A) Tests for Normality 


1) Mean and Variance. 


The mean and variance were computed for each trial. 
Assuming the sample mean to be normally distributed with 
mean zero and variance 1/n, and the sample variance to 
have a chi-square distribution with n-1 degrees of 
freedom, Chen found that none of the computed sample means 


and variances exceeded the .05 level of significance. 
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We computed the sample mean and variance for 10 trials of 
Sample size 40,000 each and found that the values 
obtained compared quite well with those obtained by Chen 
using a sample size of 106. The two sets of test results 
ane shown in Table 3.2. Fig. 3.1 allustrates the fite0r one 
of the samples of 40,000 normal random numbers to the 


standard normal curve. 


Tabilers.2 


Mean and Variance for 10 Trials of Sample Size 40,000 
and 106 each. 
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Goodness-of-Fit. 


The chi-square test was used to test the goodness-of-fit of 
the generated numbers to ere standard normal. The test 
statistics showed close agreement with the theoretical 
value and the null hypothesis of normality was accepted at 


the .05 significance level. 
Central Area under Normal Curve. 


The frequencies of generated deviates between -x and x 
were compared with the values np, where p is the area 
under the standard normal curve between -x and x for 

x = Q (.01) 4.50. There were no large deviations of the 
observed values from the theoretical. The Kolmogorov 
statistic was also computed for each trial and was found to 


jie within the Kolmogorov .10 bound. 
Order and Small Sample Statistics. 


Each trial was divided into 106/N subsamples for N = 
4,6,10,16. For each subsample the order statistics, frange, 
mean and variance were computed. The mean and variance of 
these statistics taken over the 10°/N subsamples showed 


excellent agreement with the theoretical values. 
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B) Tests for Randomness 


1) 


2) 


Serial Correlation. 


Serial correlations of lag 1, 2 and 3 were computed. None 
of the serial correlations were significantly different 


from zero at the .05 level. 


Using sample sizes of 40,000 we computed the serial 
correlations of lags 1 and 2 and compared the results with 
those of Chen's. The two sets of results were found to 


agree well as can be seen from Table 3.3. 


Run Tests. 


Runs up and down, and runs above and below the mean of 
various lengths were obtained for each trial. None of the 
chi-square test statistics computed for these runs were 


Significant at the .05 level. 
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Table 3.3 


Serial Correlations Ot lags!) | “and 279 for 10etrtals oresanple 
Size 40,000 and 106& each. 


Serial Correlation 


OOON AML WH = 


wach 


Wincan) |e 00075 ~.00017 ~.0007 ~.00005 


Expected 
Value 


3) Extreme Values. 


For each trial, 100 subsamples of size 10,000 were 
examined for values exceeding in absolute value 35054 
(which are classified as extreme values). The observed 
cumulative distribution of these extreme values agreed 


excellently with the theoretical distribution. 
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In a second set of tests, all previously discussed 
Statistics except small sample statistics were computed for each 
of 100 subsamples of size 10,000. The overall agreement of 
these test statistics with the theoretical values were quite 
good. Judging from the results of the statistical tests, the 
performance of this random number generator may be considered to 
be remarkably good. We shall next consider the. choice of a 


Suitable random number generator for our simulation experiment. 


The choice of a random number generator BOG any 
investigation is based on the properties required for that 
particular use. In the simulation of the cross-correlation 
distribution we are interested in Simulating realizations of two 
linear, stationary Markov processes (Eqn. (2.9)) which are 
uncorrelated. Since correlation between the two processes, X, 


and ¥, , could arise only through the terms, Z+, and Zt , it 


2 
is essential that the generator used in the simulation should 
generate uncorrelated normal random numbers. Furthermore, the 
generated numbers must satisfy in all respects various 
statistical criteria of normality and randomness. In the process 


of searching for a suitable generator for the Simulation, three 


recently proposed generators were closely examined - they were 
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those suggested by the Oak Ridge LADOLaAtoLy e190 b) eos |e 

P.A.W.Lewis (1969) [26] and E.H.Chen (1971) [7]. Some of the 
important features of these generators are summarized in 
Table 3.1. Chen's generator has been described in the preceding 
section. All three generators use the multiplicative 
congruential method to generate uniform U(0,1) random numbers. 
Standard normal N(0,1) random numbers are then obtained by the 


BoxveNgilen wranstormation (Fqn. {32 1))% 


Of the three generators considered, only the ones proposed 
by Lewis and Chen have been well-tested for their performances 
as uniform and normal random number generators, respectively, 
and have been found to perform remarkably well. Complete 
descriptions and results of the testing of these generators are 
available in [{7] and [26]. As for the generator suggested by 
the Oak Ridge Laboratory, there has been no report on its 


performance or of any testing done on it. 


A comparison was made between the three generators using 
samples of 40,000 standard normal random numbers generated by 
each of the generators (and the Box-Muller transformation, where 
necessary). For each sample, the mean, variance, third and 
fourth moments, skewness and kurtosis were computed, as shown in 
Table 3.4. Reasonably good results were obtained in each case. 
For each generator, samples of 1000 generated normal numbers 
were tested for ‘normality’ by the Kolmogorov-Smirnov criterion 


(see Section 4.2 for the Kolmogorov-Smirnov goodness-of-fit 
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test). In each case there was good agreement between the 


empirical and theoretical distributions. 


Table 3.4 


Comparison of Moments, Skewness, Kurtosis of Samples of 40,000 
Normal Random Numbers Generated by Various Generators. 


d th 
| Senerator | ean | var. | Fon. |a'how. [skewness] kuctosis, 


Oak.Lab. 


Lewis 


Chen 


Expected 
Value 3 


The above comparison and the information in Table 3.1 did 
not provide any strong indication as to which of the generators 
is the more superior or better of the three. It is, however, 
necessary to decide on one which would be considered most 
suitable for the simulation. Since the generator proposed by the 
Oak Ridge Laboratory has not been thoroughly tested and proved 
satisfactory for application, it was decided to disregard it for 
the Simulation. Between the generators proposed by Lewis and 
Chen, it was decided after careful consideration that Chen's 


generator would perhaps be a more suitable choice for the 
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Simulation of the cross-correlation distribution, since 


1) the generator generates standard normal random numbers 
which have been thoroughly tested for normality and 
randomness using large and small samples, and has been 


foundsto be highly satisfactory ; 


2) it has been specially designed to minimize serial 
correlation between generated numbers, which is one 
important property the generator for the simulation must 


have ; 


3) the period of the generator is at least 231 , 


Note that this does not mean that Lewis's generator is in 
any way inferior to that of Chen's. Using Chen's generator we 
can avoid having to test the generator thoroughly to ensure it 
satisfies all properties of normality and randomness, and hence 
save a considerable amount of computer time. Furthermore, the 
statistical quality of Chen's generator, aS indicated by the 
test results, are found to be acceptable and sufficient for our 
simulation purposes. This random number generator is implemented 
as a Fortran subroutine in the main simulation program. Each 
call of the subroutine returns a sequence of normal randon 


numbers of specified length. 
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CHAPTER IV 


DISCUSSION OF STATISTICAL TESTS OF SIMULATED DISTRIBUTIONS. 


4.1 Introduction to Relevant Statistical Test of Simulated 


DiStEsbubions. 


Before using the Simulated distribution for any purpose it 
is necessary to determine the accuracy of this distribution. The 
error in a Simulation experiment may arise from sampling 
fluctuations such as variations in Sample distributions which 
ace influenced by the sample size, or from nonsampling errors 
such as nonrandomness of the sample drawn and incorrect 
population distribution. Hence, the following questions should 


be considered in the present simulation : 


a) How closely does the simulated distribution represent the 


actual distribution of cross-correlation? 


b) How much sampling must be done to reduce the error due_ to 


sampling fluctuations to a desired value? 


The first question concerns the goodness-of-fit of the simulated 
distribution while the second question is concerned with the 


determination of the sample size required to obtain the 
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Simulated distribution to a desired precision. 


Since only the distribution of the sample cross-correlation 
for the case pL ear 0 is known (it is the null distribution of 
the Pearsons correlation coefficient), 2t is “wsed as the 
hypothesized distribution for testing the goodness-of-fit of the 
Simulated distribution. The distribution of the sample cross- 
correlation with either ?, or Py equal to zero is simulated 


and compared with this known distribution. The following tests, 


which are described in detail in the next section, are used : 


1) Goodness-of-fit tests. 


ayy KOdmMogoLrov-smirnoy test of goodness-of-fit [22,729 |; 


This test compares the empirical cumulative distribution 
function (cdf) directly with the hypothesized cdf. The 
measure of discrepancy used is the maximum absolute 
deviation between these two cdf's. This test statistic can 
be used to compute the necessary sample size for a desired 
precision in the simulated distribution, and to form a 
confidence band which can be used to estimate the accuracy 
of the approximate distribution given by Eqn. (1.5). This 


will be explained in the next chapter. 


~~ 


ie’ 24 - 
-_ = a) > 


aves 1 eho sy cigase oil pia om ae 
eeapeth Wea! ak. sind re ie a” 
iat 
; ie cits ~~ i 
ap teen @2 Th iesabeabenoe 
27 22320 ot eenngt Gira y war wadsal? = 
> vetyeed -eie To erat a at 
lowseG @7' oes cof Seow . ia ie marie #a49 vt honed tarsal =; 
' 1S te 7 
s2 wadvoctics ed?, .~4i judd afod or) £101 4914 ed nik th 
bent aoe ,ao boos Juwe off af lesa. a7 dune Laem ty cris 


on 


vastet 94}to-endabeos: GF 


| oP 
j ; 7, a) 
Le a 


162 52) C2 Oueaponteot Fo dean aa err Be 


s€22941b @¥ 4) 0 Uhh Rela ges. Oe. SORREATO » FRO? wae 
thn ocr ei hentia \att #255 _glinesib (365) eorsana> 
cols dtdbesa @f7 “19 bent” Youeawieehh te omseson 
eisuliode Gees, tiaT. hak ic? ans ATT seowsen sok mneen oy - 
ech 1 30> sshd ginuee WGetevedh ode saggsen 02 Soun ed 7 | 
et og dae gittidwdtatets Ee ae oat ai oolaloes ea 
‘cona off none sen. od bya od aon spun dae darko ia 
| off fT) .<R +a) savie aaa 
eee faa 


b) 


2) 


23) 


Anderson-Darling test of goodness-of-fit [1,25]. 


Since we are interested in computing the critical points 
for the sample cross-correlation, it is important that the 
tails of the simulated distribution agree well with those 
of the hypothesized distribution. The Anderson-Darling test 
which is also based on comparing the empirical cdf with the 
hypothesize cdf is designed to be especially sensitive to 
discrepancies at the tails of the distributions; and, 
hence, is used to test for good 1a Wh oy Bawa Re of the 
distributions in the tail regions. Details are given in 


Sect onit. oe 


Comparison of Simulated and theoretical critical points. 


Another method of checking the accuracy of the simulated 
diStlsbultaone 1s LO. COompale™ the. Critical: points Of thas 
distribution with those of the theoretical distribution. 
This method is based on Bahadur's [2] theory on sample 
quantiles and is described in Section 4.4. The theory also 
furnishes a method for computing error bounds for the 


critical points of the approximate distribution. 
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Let X be ae random Variable With the continuous 


probability distribution function 


U (x) Pre eon ket (4. 1) 


Let RAO 8s is se gy hy 6 DS aATSample of “size No for 2x 


ordered so that x 


distribution function of the sample X, A X 5 a eee a Xv is 


the step-function Sy (¥) defined by 


0 fot x= < X 4 
Sy (x) = K/N for X, $x < X,.4 
1 LOL sa Xv 


Tiat Ss NSy (x) equals the number of variables Xie wep WS. Cu 


are less than x. For large N one would expect 


Sy C) 4g U (x) AS eels es Ours 


The goodness-of-fit problem is to devise a test of the 


hypothesis 
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Hee Aer (x ae, EO asl x ae 


where F (x) is a completely Specified, hypothesized 


distribution function of the randon ee  & 


For the purpose of testing the hypothesis Ho » Kolmogorov 


foo yen introduced sin) 1933 the statistic 


DIN) S Rc ee el (ea FAX) ae, (4. 3) 

-o<x< oO 
which measures the maximum absolute deviation between the sample 
cumulative distribution Sy (x) and the hypothesized cumulative 
d2stribut ion eur (x)")i. ae ieetejected it D(N))) 1S esutticientsy 


large. 


The probability distribution of the random variable D(N) 
depends on N but is independent of the special form of F (x) 
provided only that Fix)” is continuous (that is, the test as 
distribution free). The exact distribution of D(N) is: (not 
known but Kolmogorov found that (N1/2)D(N) has a limiting 


distribution given by 
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The function L(z) has been tabulated by Smirnov [37]. The 
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early work of Kolmogorov and Smirnov is summarized in [22] and 


{36]. Tables of the critical values of D(N) have been 


Mass cyan (e249) 7 eBirnbaun, (4) and 


gives the critical values of 


This test of goodness-of-fit was proposed by 


Darlung sidan 19595) The test 


theyestalls ~of theridistributcion 


test procedure is the following 


Sample from the random variable 


us = F(X; ) ’ 


where 


HUNCTLONGTOr ws X71 and Let. Fo (x) 


function as defined in (4.2). 


Anderson and Darling is 


Ns [S (x) 


- 0 


w(N) = 


D (N) 


=F (xpegees (E(x) pdigdh (x)ie, 


given by 


Miller [31]. Appendix Table D1 


computed by Massey. 


Anderson and 


is sensitive to discrepancies at 


rather than near the median. The 
be N ordered observations in a 
MG Ihe 

(4.7) 


F(x) is a completely specified, hypothesized distribution 


be the empirical distribution 


The test criterion suggested by 


(4. 8) 
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where y(F(x)) = yw(u) is some non-negative weight function 
chosen to accentuate the values of a(x) h(x )eeeWhere =the test 
is desired to have sensitivity. The hypothesis that the sample 
has been drawn from the distribution F(x) is rejected if W(N) 


is sufficiently large. 


For the test to be sensitive to discrepancies at the tails 
of the distribution  ¥(u) should be large for u near 0 andl, 
and small near u = 1/2. The weight function chosen by Anderson 


and Darling is 


WCU) ieee elma) ° (4.9) 


This function has the effect of weighting the tails heavily 


Since it is large near u =0 and u=l1. 


Subsituting (4.9) for wy(F(x)) , Eqn. (4.8) may be written 
as 

re) [S. (x) - F(x) J]? 9 aF(x) 
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By straightforward integration and collection of terms , (4.10) 


reduces to 


N 
WANTS =—8— Nee ree ae 1) (in whe + Lana es hs ecepre pilot 


(4.11) 


If this number, W(N), is too large, the hypothesis that F(x) is 


the true distribution is rejected. 


Asymptotic Significance (or critical) points for W(N) are 
given by Anderson and Darling (see Table 4.5). Significance 
points for W(N) for small sample sizes have been determined and 
tabulated by Lewis [25] who also gave the following equivalent 


form of the test statistic W(N) : 
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Since the theoretical and approximate density functions of 
the sample cross-correlation are synmetrical about © = 0 in the 
null case, only the upper (or positive) critical points need to 
be considered. The critical point of ry, at the a -significance 
level is also the (1-2 )-quantile of the cdf of [yy : 
Bahadur's theory [2] on sample quantiles which is used in the 


comparison of critical points may be outlined as follows : 


Let F (x) be the theoretical probability distribution 


function. The p-quantile, oy , of F(x) is defined by 
F(ED ) = aep a, OS<apy< a. (4.13) 
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Let 

(X, Xo re eye A Xy) be a random sample from F(x), 

Y be the sample p-quantile, 

N,p 

By be the number of observations KX, in the sample such 
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£(x) be the density function corresponding to F(x), 
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Then, 


Teen ep (Bee Nahe NE0E eee 
(4.14) 


where Ry becomes neglible as N + ~. It was shown by Bahadur 


that with probability one, 


Ry = O(N-3/* Log N) aSeee Naa « ps 


Hence, for N large Ry may be assumed to be zero and the 
term Lee waen 9 4 we(5,) gives an estimate of the possible 


difference between the sample and the theoretical p-quantile. 


AS a means of checking the accuracy of the simulated 
distribution, the p-gquantile (or gq-critical value) can be 
computed for the simulated and theoretical distributions for 
P4P5 = O and the observed difference between the two values 
can be compared to the corresponding value of the above-stated 
error estimate which is computed for the two distributions. This 
test on the simulated distribution will be performed in Section 
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Se OeerrTor sounds. for” Critical. Points of Simulated and 


Approximate Distributions, 


The theory on Sample guantiles [2] may also be utilized to 
estimate error bounds for critical points of the simulated 
distribution and the approximate distribution of Eqn. (1.5). As 
can be seen from Eqn. (4.14) , the error between the theoretical 
and simulated quantile will not be the same for all quantiles, 
and, hence, has to be determined individually for each quantile 


as shown below. 


it is known that Yy p is approximately normally 
? 


distributed when N is large with 


E(Yy p= ae : (4.15) 


Var (Yy ») pg / Nf? (€) (4.16) 


and N1/2 (ty po S is asymptotically normally distributed 


p) 


when N77 © With 


E(N1/2(¥y - by) = 0 : (4.17) 
Var (N1/2(Yy - ED) ) =—— pd 6/, £2 (&) : | (4.18) 
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Let 


UN, p = YN,p 7 Ep ° (4.19) 


Since Rive Unie is asymptotically normally distributed, we can 
} 
determine, for various large sample sizes N , certain critical 


points of the distribution of UN, as shown below. 
? 


Let Zui p denote the standardized nornal variable 
9 


Nt720y, p° Then 
$ 


i] 


ZN,p [Nt/20y pada /al vane (Ns aU elt 
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[N2/20y pf G)] / (payi/2 (4.20) 


Let eeze be the upper a <- critical point of the N (0,1) 


distribution. Then we have, 
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N1/20N yb mos given by 


Ze (Pa 287 BE 


It follows that the ‘critical values of the asymptotic 


distribution of Uy p are given by 
3 


UN, ya Be ea) Peet. (Nt 2f(&) J ° (4.22) 


UND, gcan be used to determine an approximation to the error 
between the simulated and theoretical critical points of the 
p(r) distribution. The difference between the simulated and 
theoretical critical values for a simulation sample size N is 
compared to the values of UN De - If this difference is less 


than U for the specified % , then we are (1- %) 100% 


N,p, Qa 

certain that the error between the simulated and theoretical 

CLiCrca | spOiNtencracenose. —U - Hence U ma be taken 
P N,p,® N,p,% Y 


to be the resultant error in simulation for sample size N. 


From Eqn.(4.22) we can compute the sample size which is 
necessary for a desired precision in the simulated critical 
points 40f p(t). SEor example; -tosbe 199% Psure that thererror 
inGthe vsisilated™ .01%— critical point vs less than -02 , we 


have, 
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Therefore No = 51784 . 


Hence, for any sample size N 2 5200 used in the simulation the 


error in Simulation for the critical points is at most .02. 


4.6 Kolmogorov-Smirnov Test on Simulated Distribution. 
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To determine the accuracy Of the Simulation, the 
distribution of the cross-correlation for the case P,P, = ON Grs 
generated and compared to the known distribution given by 


Egn.(1.4) by means of the Kolmogorov-Smirnov test. The following 


notation is used : 


Pp, (6307k) = approximate distribution p(r) with P4P5 = k 
pte ot, k) = simulated distribution p(r) 

P(t ¢nk) = theoretical or true distribution p(r) 

Fy, (E30,k) eco is OL Py (c3n-k) 


Fo(r;n,k) = cdf of Pg (C30, k) 
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Cdie of Pm (v3 0, k) 


h = sample size of Markov series 
N = number of simulated r values 
Dan (N) = Max [Fn (t3n,k) = Fo(rj3n,k) | 
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e—critical value. of Dam(N) , Such that 


P{Don(N) > D(N)} = a 


Using the table of critical values ; D , (M) r by Massey 
given in Appendix D (see Table D1.) , we can estimate the 
Sample size necessary for a desired accuracy in the simulated 
GI Strivucton. |For a large sample size, N , we can Say we are 99% 
sure that the simulated distribution F o(r3n,0) Will lie within 
126310) 1/2 of the true distribution F(r3n, 0) over the 
entire distribution, if the observed maximum deviation D on(N) 
is less than the 201) =scratical value D 916%) . Therefore to 
make this statement for a deviation of SO) the necessary 


Sample size is found as follows : 


From Appendix Table D1 we find that 


D941 (N) 1.63(N)-1/2 = .02, 


6643 . 


therefore N 


Hence, for a sample of 7000 the error in the Simulated 
distribution should be within 2% over the entire distribution 


with 99% probability. 
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Using sample sizes ranging from 50 to 7000 , p(r) with 
P4P5 = O was Simulated for n= 10, 30 and the Kolmogorov- 
Smirnov Statustic Dan(N) was computed for each of the 
Simulated distributions. The observed values of Dam (N) and 
corresponding critical values Dee (Neer Meee Ola ew OLD mee 


are tabulated SiMe Cabves shee. Fig. 4.1 shows values of 


D oN) plotted against N , for both values of n. 
Table 4.1 


Kolmogorov - Smirnov Statistics for Simulated Distribution 
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As shown in Table 4.1, the observed Kolmogorov-Smirnov 


Statistic, Dan (N) e for each of the distributions simulated is 
less than the corresponding -Ol=criticaleavalue. SeUsings the 
Kolmogorov-Smirnov Criterion, the simulated distribution , 


F, (c3n,0) , for a sample size of 7000 is within .0108 of the 
theoretical distribution , Fin (x3 n= 10,0) , and is within .0094 


of Fn (©; n=30, 0) With 99% probability. 


Hence, uSing sample sizes of 7000 we can estimate with 
99% certainty that the error in the simulated distribution will 
bee atwimost 4.02 (lover the entire wdistribution foriany value tof 
ie Base Loe. (Note that determination of the sample size for this 
degree of accuracy in the Simulated distribution is independent 
of the value of 1) Having estimated the error in the 
Simulated distribution, we Shall now apply this upper bound on 
the error estimate in the confidence band technique furnished by 
the Kolmogorov-Smirnov test to evaluate the error in the 
approximate distribution , p*(r) , for values of P9499 #0 as 


will be shown in Section 5.1. 


Note that the accuracy in the simulated distribution can be 
improved by using a larger simulation sample size, N. To obtain 
a maximum of 1% error over the entire simulated distribution 
would require a sample size of over 25,000. However, this 
approach is expensive in terms of computer time. Hence, we have 
limited the value of N used in this simulation to 7000. It is 


also obvious that it is impossible to obtain very significant 
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improvement in the accuracy’ *by” this direct sampling method. 
Therefore it is important and profitable to explore (perhaps in 
further research) aiternative methods such as variance = 
reduction techniques for reducing the sampling variability of 
Simulations and improving the accuracy without increasing the 


humber of simulations. 


Tables 4.2 and 4.3 show the frequency distributions of the 
Simulated r values for the case P,P, = 0 forene=, 10 andad°ns= 30. 
Figs. 4.2 and 4.3 illustrate the fit of the simulated values to 
the theoretical values both for the density function and the 
cumulative distribution function. Table 4.4 compares the 
moments, variance, skewness and kurtosis of the Simulated 


qistributaon to® "the theoretical’ distribution. As can be seen 


from the table, the two sets of values agree quite well. 


Note that in order to compare the generated (or simulated) 
and theoretical p(r) values, the values of the generated p(r) 
and cat in columns 4 and 5 of the frequency tables (4.2 and 


4.3) are given by , 


Generated p(r) = Frequency / [ (N) (Interval Width) ] 


Generated cdf = Cumulative Sum of Frequency / N 
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Table 4.2 


Simulated and Theoretical Distributions of Cross-Correlation 


N = 7000 h = 10 P4P5 = 050 
Range Freq. Gen.P(r) Gen.CDF Theo.P(r) Theo.CDF 
=A Oee oO 79 2 0.0057 0.0003 0.0001 0.0000 
=O oe Ua 1 0.0029 0.0004 020033 0.0001 
=O. 908) —O a> 2 0.0057 0.0007 0.0141 0.0005 
= 69 aioe 13 050371 0.0026 0.0356 0.0017 
= A)outs 0 Ola eh ad 0.0486 0.0050 De ose, 0.0042 
= Oe Oras 43 OpALeg OS Oita 0.1168 0.0088 
=O) eh =e OD 56 0.1600 0.0191 0.1765 0. 0161 
cma AI Sys eS LENS 97 Dae tale 0.0330 0.2475 0.0267 
woh ODS OV OL 7 or hy SE) ig 0.3343 0.0497 0232030 0.0410 
=U Sy Se Wee) 0. 3629 0.0679 0.4157 0.0596 
ado) 1) ers Ah 173 0.4943 0.0926 Dre Uae O02 0627 
=-0545 —() U0 191 0.5457 0.1199 Oro Ua, 0.1104 
is HOP =i) 35 249 0.7114 0.1554 0.6942 0.1428 
ol Ps iSy = Rad ORS HE 302 0.8629 054936 0.7825 051798 
oO Sc Hth “ t ees) 296 0.8457 0.2409 0.8639 0.2210 
wed OPPS Pend SPA) 324 Vo22ou 0220/3 0.9359 0.2660 
oe ek) oA) oad) 359 1.0260 0.3384 Os9963 0.3143 
A) ad oe) on) 360 1.0490 53909 1.0430 0.3654 
=o ee) 5 364 1.0400 0.4429 1.0750 0.4184 
oot Ws) 0.00 38H 1.1060 0.4981 e920 Ono 
0.00 O5.0p B52 1.0060 0.5484 1.0920 ie ues 
0.05 0.10 Spey”, 1.0060 De OI Gd 1.0750 0.5816 
0.10 OS Sve 1. 0660 0.6520 1.0430 0.6346 
0.15 De20 341 0.9743 0.7007 0.9963 0256857 
0.20 Ore Sieh 1.0030 Ost pan) ORS SN, 0.7340 
0.25 25.0 330 0.9429 0.7980 OES Has) Oa Tbs 
0.30 0535 261 0.7457 OSHS Rae 02/825 0 ee = A UF 
Os 0.40 Zoe 0.7200 0.8713 0.6942 ei eon 
0.40 0.45 204 Oe sr ees, 0.9004 0.6017 0.8896 
0.45 0.50 164 0.4686 ESS as Pe) 0.5079 OPS ie A) 
0.50 O53) 170 0.4857 0.9481 0.4157 0.9404 
Ue) 0.60 147 0.4200 0.9691 0.3280 0.9590 
0.60 0.65 80 0.2286 0.9806 Dae 6 fe phe YS 
05,55 UES FY 61 0.1743 0.39893 Op Pah te: Ooo. 
0.70 0) sesiD 35 0.1000 0.9943 0.1168 OEP a he 
0.95 0.80 2D 0.0714 OASIS AS) 0.657 0.9958 
0.80 a) ‘bi 0.0343 0.9996 0.0356 0.9984 
0.85 0.9.0 1 0.0029 OPS RS RS AY 0.0141 Oya he hs Ne 
0.90 0395 2 0.0057 1.0000 0.0033 OE eS RS MERS, 
0.95 1.00 0 0.0000 1.0000 0.0001 1.0000 
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Table 4.3 


Simulated and Theoretical Distributions of Cross-Correlation 


N = 7000 n = 30 P4Pr = 0.0 

Range Freq. Gen.P(r) Gen.CDF Theo.P(r) Theo.CDF 
=a Ome 05 95 0 0.0000 0.0000 0.0000 0.0000 
=O. tea 0 0 0.0000 0.0000 0.0000 0.0000 
=o Oe 5) 0 0.0000 0.0000 0.0000 0.0000 
seesteish  SaUeytsnt, 0 0.0000 0.0000 0.0000 0.0000 
=o 000) 0.7 5 0 0.0000 0.0000 0.0000 0.0000 
We tay Sti Tae) 0 0.0000 0.0000 0.0001 0.0000 
=) LON Ole Oo 0 0.0000 0.0000 0.0008 0.0000 
eolbeaisvel eA OAH) 2 0..0:057 0.0003 0.0033 0.0001 
SHOAL) MU late 6 0.0171 0. 0001 020113 0.0004 
—Uidsiay Uren e) (he 0.0343 0.0029 Oe OF eS 0.0015 
=) ot) Bie Ore th 22 02.0629 0.0060 020753 0.0040 
eto me Os 4 © D2 0.1486 9.0134 0.1570 0.0096 
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Table 4.4 
Moments, Skewness and Kurtosis of Simulated and Theoretical 
Dist eapution- 


pace) +0 N = 7000 


Mean 0.0042 0.0000 0.0018 


Variance 0.1108 0.0345 020350 
Third Moment -.0050 0.0000 0.0001 
Fourth Moment 0.0297 920033 020035 
Skewness -.0136 0.0000 020202 
Kurtosis 225 2.8070 Lai2o9 


One of our main objectives is to determine the critical 
values of the distribution of cross-correlation. We are 
therefore interested in the tails of our simulated distribution 
- those areas at the ends of the range containing 1% to 5% of 
the total area under the curve. To obtain good estimates of the 
critical values, the simulated distribution should have a good 
fit at the tail regions. Since the Kolmogorov-Smirnov test is 
not particularly sensitive to discrepancies at the tails of the 
distribution, the Anderson-Darling test was used to check the 
goodness-of-fit at these regions of the simulated distribution. 


The following notation is used : 
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N = number of simulations of rievalues 
pane: ‘ : A : 
Li ggire it observation in a sample of N simulations 
0B Ss Fin (zt; 3n,k) (theoretical cdf) 
W(N) = Anderson-Darling statistic given by 
ul F 
W(N) = N= N fy (25-1)C 10 u; + in (1-uy_ 541 ) J 
WL OY) = a - critical value of W(N) 


The above test was performed on the simulated distributions 
with po = QO E£ory i nie= 104, 30%) susingsthe known “theoretical 
distribution of Eqn. (1.4). Results of the test appear in Table 
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Table 4.5 


Anderson-Darling Statistic for Simulated Distribution 
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The values of W(N) ‘obtained for both values of n are well 
below the the 5% significance point for all cases of N. Hence, 
we may conclude that the simulated distribution has a good fit 


at the tail regions for all values of n > 6. 


Using Bahadur's theory [2] on sample quantiles described 
in Section 4.5 , a comparison is made between the critical 
points of the simulated and theoretical distributions for 

pes = 0, as a final check on the accuracy of the Simulation. 


The following notation (with reference to Section 4.5) is used : 


CUe ae nD 
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By = number of observations ina sample of [Lr 
that are greater than Tp, g(aek) 

7p OR critical value of the N (0, 1) 


distribution, that is, 


ot «Ae bunk paw 90 
: a 30 nouss S168 


oo oY eH aa tendiarany 9 


vet 
joes i aw : 7 . 
iwied Juptauea Depbrecaeit Sie Reeeiels €e gone 
a — 
; = a ; 7 ; 
_ 
ce fh iL3aqcdp -Si_am 45 [£5 eiebad iioriuhaainal Lead — 
‘om. ont. cetvbed efaw 9°) s chmehaii ha dB, e 20Ree ml 
ecot“sdinset fh Aaesisesoeds Ohe dtetveta wee ior ie oF ; 
wal : 7 
pivats Ley to a 2 is Cl 2 nay 60 on t< > tue a @P sv = “e a _ 


- : wie 
wel oe wwlthed oF Segeonped alia) Mexreton enivokion ede 


- | — 
g-F - P ~4 


i = ote the be ethenRitreas~ @ vet? = (th. oF 


a 


& Jena? . 
_ 4,2 e | : 7 
oe pute See ee 24a 7. 
2% Ginav Teper 7. , StennnesTe = ‘ths 
j 7. : 
2 Sb guker Junkita ~ 2 ceetege ~ tava » gy 
: | 
: wy mona we wwdans. a ee 
£0 i ionicearth a tw nics Bede ee = bi 
> ve 


oa a = fo pare okt 36 lt teil 
a a a usenet 
7 a8 vi 7 5 - 


_ 


_ ay le 


83 


ig | Ga SS Zo ) tsi So 
Wg = | Eg iq (ark) = Cm gq (aek) | 
UN, g,a = am CLA t 1 Cds Wa lic sof Wg Fae als Wale on oe ope 
Ertl UN,q = Ung, ie Se be 


Rewriting Eqn. (4.14) in terms of the q - critical value of r, we 


have , 


Ree Geis erat r hi CCN ppe NG) er7 ON Ee (ome SNS ies 
(4.22) 


where with probability one, 
ct O(N-3/4% Log N) 


becomes neglible as N* @, 


Assuming By to be zero for large N, the tern 
gives an estimate of the possible difference between the 
Simulated and theoretical q - critical values. This term may 


be compared with the observed difference, 


UN,qs i Pg,q (ark) 7 Ip gq (aek) li. 
as a final check on the performance of the simulation. The 
theoretical and observed differences for the case 1° = 0, 
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Table 4.6 


Theoretical and Observed Difference between Simulated and 
Theoretical Critical Values 


P,P 0 P N = 7000 


Table 4.6 shows that the observed differences are less 
than the theoretical differences in most cases and do not differ 
too greatly from the theoretical values in others. This serves 
to indicate reasonably good accuracy in the simulated 


agstripution. 
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An error bound for the simulated gq - critical value for a 
particular set of paraueters (Ny 6,,) can be determined using 
the theory described in Section 4.5. It was shown that the 2% - 
critical value , UN,g,o . of the difference between the 


theoretical and simulated critical value, UNgs is given by 
3 


Uap = Z_(pq)t/2 / CN*/2pn (tp gids P4P,) ] 7 (4.23) 


where Gap) | e> Pandas, Peis Che “ar 'crictical value of the 
N(0,1) distribution (as defined in the previous section). The 


values of for Papo = Oe ig CO i= re 0 ls 0257 0, 


ods © 
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ine tabie ho, 


Comparison of the observed differences , UN,g @ in column 
5a of @ Table 456 swith the critical values of Table 4.7 ‘shows 
that in all cases , the observed values of Un,g are much less 
than the Uy,g, o1 values. Hence, we can, for example, say with 
99% probability that the error in the simulated .01 - critical 
value for n= 7000 , n = 30 is not more than Wig ue Obes -0172. 
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Significance Level 
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CHAPTER V 


DETERMINATION OF THE ACCURACY OF THE APPROXIMATE DISTRIBUTION 


AND SITS: CRITICAL VALUES. 
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One of the most useful features of the Kolmogorov-Smirnov 
test is that the critical values of the test statistic, D(N) 
(Eqn. (4.3)) may be used to set a confidence band for an unknown 


CODLINUCUS ar StLCiIbDveTOn function. 


Let F(x) be the true unknown distribution function, 
Sy (x) be the sample distribution function based 
on a Sample drawn from F(x), 


D (N) be the a -critical value of D(N), 
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Thus for any unknown distribution function F(x), a random 
sample could be drawn from the distribution and a confidence 
bands. ofe= width + DY) set up around the sample distribution 
function Sy (X)- SO that with probability 1- a the true 
E(x) Me ves Ven tacvel ys @withimieithismeband.e. This: isha simplesend 


direct method of estimating a distribution function. 


We shall now apply this method to compare the approximate 


and simulated distributions for a value of # 0, and 


PLN 
estimate the error in the approximate dt ser ibution. The 


procedure is described in the following section. 


5.2 Determination of Accuracy of Approximate Distribution. 
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By simulating the distribution of the cross-correlation, 
Cyy « for the case 4°59 = 0 and comparing it to the known 
theoretical distribution of Eqn. (1.4) using the Kolmogorov- 
Smirnov criterion, we estimated the error in Simulation to be at 
most .02 over the entire distribution. Since this error will 
mainly be due to sampling fluctuations (that is, variations in 
Simulation results which are dependent on the size and number of 
samples of r taken) we shall assume that the same error bound 
holdsmeinethe simulation of ) p(z)ee for non-zero Values of P4Pn 
That is, the error in the simulated distribution with 
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values iS within 2502, ,0r the eetruce soi striburt on mor Go tor 
P,P, aK With § probability 99. This assumption can be 
justified by showing that the assumption holds true for some 
known distribution of a form similar to p(r) and simulated by 
means of the same random number generator. The justification of 
this assumption by showing that it holds true for the normal 
distribution is given in Appendix B. We now proceed to make an 
estimate of the accuracy of the approximate distribution, 


p (x), given by Eqn. (1.5), using the confidence band technique 


described in the previous section. 


Having estimated the error in simulation to be at 
most .02 for sample sizes of 7000, we now simulate the 
distribution of r for pe eK mee ee eK eel Ol), sm cuneGacV.el 
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value of n, using this sample size. We then set up a confidence 


band of width +.02 around the Simulated cumulative 
distribution F o(03n,k) (cefer Section 4.6 for notation). We 
can say with 99% certainty that the true cdf F, (030,k) will 


lie within this confidence band, that is, within +.02 of the 
Simulated distribution F(tsn,k). The resulting probability 
statement, for the given values of n and k, and all values of Tr, 
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Next, we evaluate the approximate cdf, F, (c3n,k), using the 

. . = * 
approximate density  -funct2.0n pelts COL me Cone on ews th 
P4P, SL and compute the maximum deviation 


between F, (U3 n,k) and F, (r3n,k) « which is given by 


D oa (Non,k) = Max [F, (c3n,k) - Fo (rjn,k) | : (528) 
eat Goel 
jie Doq (Ni n,k) < .02 5 Pee sD,s)) 21es within the 


-02 confidence band of Fo (C3;n,k). The deviation between the 
approximate and true distribution can then be at most +.04 , 


chat. 1s 
1F, (c3n,k) = Fr (tso,k) | <a re 


Thus we can conclude with 99% certainty that the error in the 


approximate distribution is less than .04. 


If Dey (N31,k) pas ei Op “ Fa, (032,k) falls outside the 
confidence band and we may Say that the error in the approximate 


distribution could be greater than .04. 


The above test was performed on the approximate 
distribution for the following values of the parameters : 
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a = aia = 9 p ae Ss aetegt 4 n = 30 
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The results of the test are tabulated in Table 5.1. Tables 5.2 
- 5.6 show the frequency distributions of the simulated r values 
for the parameters (0, Pye.) Pee 1p eG Oee |) (0G eso), 

(307—.1)% 8 (30,-.49). Figs.) 5. 1 = 5.9 ii lustrate the fit’ “between 
the Simulated and approximate distributions, for all the 
parameters considered. Tables 5.7 ~ 5.9 compare the moments, 
skewness and kurtosis of the approximate distributions with the 


corresponding values of the Simulated distributions. 


Table 5.1 


Maximum Deviation between Approximate and Simulated 
Distributions 
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Simulated and Approximate Distributions of Cross-Correlation 


N = 7000 n= 10 =) Out 
PsP. 

Range Freq. Gen.P(r) Gen.CDF Appr.P(r) Appr.CDF 
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Table 5.3 


Simulated and Approximate Distributions of Cross-Correlation 


N = 7000 = = 
n 30 PsP. Ona 

Range Freq. Gen.P(r) Gen.CDF Appr.P(r) Appr.CDF 
= Ome =O. 95 0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0001 0.0000 
0 0.0000 0.0000 0.0004 0.0000 
1 0.0029 0.0001 0.0022 0.0001 
2 0.0057 0.0004 0.0080 0.0003 
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Table 5.4 


Simulated and Approximate Distributions of Cross-Correlatio 


N = 7000 n = 30 poo. «= 0.42 . 
12 
Range Freq. GensP(t) Gen-CDF Appr.Pi(r)) Appr. CDF 
ee Ome Oe) 0 0.0000 0.0000 0.0000 0.0000 
a ae 7 a eo 0 0.0000 0.0000 0.0000 0.0000 
Abie Silt ebay eee! 0 0.0000 0.0000 0.0002 0.0000 
sat aisioy adbawent) 1 0.0029 0.0001 020025 0.0001 
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Simulated and Approximate Distributions of Cross-Correlation 


N = 7000 n= 30 = -.49 
Rigs 
Range Freq. Gen.P(r) Gen.CDF Appr.P(r) Appr.CDF 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0000 0.0000 
0 0.0000 0.0000 0.0004 0.0000 
1 0.0029 0.0001 0.0016 0.0001 
1 0.0029 0.0003 0.0057 0.0002 
5 0.0143 0.0010 OU 0.0007 
0.0829 0.0051 0.0510 0.0024 
Oe ASE 0.0714 0.1341 0.0066 
0.2514 0.0240 0.3047 0.0170 
0.6800 0.0580 0.6350 0.0397 
1.2340 OA UTE 1.1740 0.0840 
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0 0.0000 1.0000 0.0000 1.0000 
0 0.0000 1.0000 0.0000 1.0000 
0 0.0000 1.0000 0.0000 1.0000 
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Tables .6 


Simulated and Approximate Distributions of Cross-Correlation 


N = 7000 n= 3 =e 2 
0 oe 10 
Range Freq. Gen.P(r) Gen.CDF Appr.P(r) Appr.CDF 
et 008 —0.95 0 0.0000 0.0000 0.0000 0.0000 
=A EOIN E, 0 0.0000 0.0000 0.0000 0.0000 
oa ini 0 em O 0 0.0000 0.0000 0.0000 0.0000 
Weekes “Sxbigtehy) 0 0.0000 0.0000 0.0000 0.0000 
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aOe4Q 9) =O 535 88 0.2514 0.0214 O22 109 0.0135 
=e SON a= Ors 0 13.0 0.3714 0.0400 023793 0.0286 
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0.70 0275 0 0.0000 1.0000 0.0000 1.0000 
ey TAs) 0.80 0 0.0000 1.0000 0.0000 1.0000 
0.80 0.85 0 0.0000 1.0000 0.0000 1.0000 
0.85 0.90 0 0.0000 1.0000 0.0000 1.0000 
0.90 OPES Bs) 0 0.0000 1.0000 0.0000 1.0000 
0.95 1.00 0 0.0000 1.0000 0.0000 1.0000 
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Moments, Skewness, Kurtosis of Simulated and Approximate 


Mean 


Variance 
Third Moment 
Fourth Moment 
Skewness 
Kurtosis 


Moments, Skewness, 


DIStCEVDUtCLON. 


=e ian N = 7000 


0.1213 
0.0000 
0.0349 
0.0000 
2.3800 


2.4038 2.7480 


Table 5.8 
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Table 5.9 


Moments, Skewness, Kurtosis of Simulated and Approximate 
DistmvOutron. 


hn = 30 N = 7000 


The values of D a, (N) obtained for Pane = sl, «49 and 
n=10, 30 are well below the .01 significant point, D 9 4(7000) = 
-0172 (and therefore less than .02). Hence, we can say with 99% 
confidence that the approximate distribution is accurate to 
within + .04 of the true distribution for these values of the 
parameters. However, for higher values of pop_ the results are 


i, 
not as satisfactory. For po = silly «Cl, 2 =, 30,. thevalttes 


te 
of Day (N) exceed -02 indicating a possible error in the 
approximate distribution greater than .04. A comparison of the 
Simulated and approximate distributions in Fig. 5.7 and the 


values of the moments in Table 5.9 indicates the variance of the 


Te) and small n 


approximate distribution for high values of PP. 
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(0%= 30) Guisie tooy Large: Fag. Dol shows the simulated 
distribution has a narrower spread and higher peak than the 
approximate distribution. The difference between the two 


distributions is less severe for the lower value of 


P4 Po 
( Pee -72). From these observations it would appear that for 
high values of the autocorrelations, a more accurate 


distribution may be obtained only with larger values of n, since 
the variance of the approximate distribution decreases with 
increasing n. This can be observed from the graphs in Fig. 1.1. 


The approximate distribution for p = 720206 NaS eal Variance aor 


ey 
=0900) for n= 50 “as comparede to ai variance of .1925 for n = 36. 
Comparing the simulated and approximate distribution for 
negative products of the serial correlations (that torte <20 
it) can be seenwftronm Tabler 5sl--and “Figs. 5.4, 5.5 that the 
approximate distribution closely resembles the simulated one for 
Sa SP GO on om HOME VCL 2 fOLRO Pe ees oul oigree Sty aN ike(oey Jers) eats 
the approximate distribution again seems to have a larger 
variance than the simulated distribution, where the simulated 
distpibution (that. iS,,0the enpirical@trequency distribution) 
represents the true (unknown) distribution of the Sample 
crosscorrelation Tyy ° Thus the approximate distribution not 


only has larger variance for large positive Pp values but also 


0 
eee 
indicates an increase in variance for decreasing values of 


Pie. < -.50, which does not agree with the behaviour of the 


Simulated distribution. 
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In order to illustrate this discrepancy in the two 
distributions the variances for both the approximate and the 
Simulated) distribution were plotted for n= 30 an Fig. 5.8. 
The corresponding values of the variances are listed in Table 
Delt Olethis Case, nh |= =) S0,) the. Vvarvance, scp 


1552 0 es 
approximate distribution appears to have a minimum close to 


for the 


05°, = -.6 whereas the Simulated variance tends to zero as 


tends to -1. 
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the primary implication» of =the results sgiven) in this 

section appears’ to be the following. The approximate 

CUStribubiony for svalues of §-.5) 054 pp <7 and nese eS 
1. 2 


accurate to within + .04 of the true distribution of cross- 


correlation with 99% probability. For high values of Pes | and 


2 
the same values of n, the approximate distribution appears to 
have too large a variance. The error in the distribution in this 
case could be greater than O04. For larger values of n, 
however, the approximate distribution for high values of eas 

could be a more accurate representation of the true 


distribution, since the variance of the distribution decreases 


With increaSing n. 


To obtain a more definite pattern of behaviour of the 
approximate distribution with respect to the parameters, on, 
P.©, , more thorough testing would have to be performed on the 
distribution for various combinations of the parameters. Due to 
the high computer cost involved in simulating and testing each 
distribution (see Table C in Appendix C) we have limited the 
testing of the approximate distribution to the few values of the 
parameters considered above, and observations on the behaviour 
of this distribution have been made based on the results of 


these tests. 


Since the degree of accuracy in the approximate 
distribution for a particular value of p, 0, Varies with n, it 


is possible to test for a value of n which will give a desired 
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precision in the approximate distribution for that value of 
p,°., - For practical purposes, however, it would be useful to 
establish for each value of Pa P or a Minimum value of n with 
which the approximate distribution may be applied with 
reasonable accuracy (since in practice, series of data 
encountered are usually small). In the next section we shall 


apply the confidence band technique to estimate a minimum value 


of n for a particular value of PaPos 


Since the approximate density function as given by 
Eqn. (1.5) is dependent only on the sample size n and the 
product of the autocorrelations ?(,P.,, it is useful to be able 
to establish a minimum value of nh, say ilar for which the 
approximate distribution for a particular value a can be 


used with reasonable accuracy. 


Let = be the desired accuracy we wish to have in the 
approximate distribution. Let N, be the sample size which will 
give a simulated distribution within te /2 For | the true 
distribution with 99% probability. The algorithm for 
estimating ioe given a value of °,°,, may be summarized in 


the following steps : 
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UmEOr an sinitial value “of n,. evaluate F, (tv 3n,k) using 


EQGiinat (ile .O)es 


2) Simulate the distribution Fo(r;n,k) using a sample of 


size Ne. 


3) Compute Da, (Nes n,k) by Eqn. (5.4). 


4) If Day (Ne30,k) Sees DU Ce = eee era nd Ont On (mnt. 


Lf Day (Ne 30k) 2. en, PUL 1 — 7 tera na ext te 


Taking °4P9= .04, Ne = 7000, the value of na estimated 

by the above algorithm was 6 for ?,°, = .1 and 10 for 

P,P, = ~-49. The above aigorithm can be used to determine the 
Smallest sample size that can be used with each value of 

in lop, ! < 1. However, from the discussion in Section 5.2 the 

range of p4,9,- values for which these calculations should be 
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5-4 Critical Values of the Approximate Distribution. 


Approximate critical values of sete es # 0 can be 
determined using the approximate density function of Eqn. (1.5). 
It is, however, necessary to estimate an error bound for these 
approximate critical values. The procedure used is based on the 


theory on error bounds described in Section 4.6. 


We first simulate the distribution of © for 9,8 =k # 0 
using a sample size of 7000 for the given value of n, and 
evaluate the gq-criticali value, Lo q (nek) (refer Section 4.9). 

3 


Then we can Say With (1- 2) 100% certainty that 


[tpg (nek) ~ Eg,q (Bek) I < UN,q, 2 (52:5) 


where fr (n,k) is unknown and u a is the critical value of 
eee N,q; 


u given by Eqn. (4.23). 


N,q 


Note from Egn. (4.23) that, to compute u we require 


N,q a” 
the value of the true density function at the critical point. 
However, Since both these values are unknown and since we have 
shown that the simulated distribution has a good fit at the tail 
regions for °4,P5 in (-.5,.5), we shall replace Pp (tp, g ¢n-k) by 
the simulated value, that is, we compute 


Zq(payt/ 2 7 (Ni ep a(t. | sMeOyP > J) (5.6) 
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Next, we evaluate the approximate critical value, 
Ta yg (ack) , using the approximate density function of 
Eqn. (1.5), and then compute the absolute difference between the 


approximate and simulated critical values, 


[Dy g nk) = Tgiq (nek) I : (S501) 
ite ech S) = eg, < Ung, , then we know that 
| nak eae (n,k = flee ,K) - 7K) - 7k) + 7k 
a Geol) > Sp Oe ils Tey seas) = ey ito) laa Eid Owe abh soy! 


< Ir, {n,k) = To(ne k) | 
+ [rp (n ek) ad Ty (n,k) | 


A 


2u : 
Neo, & (5. 8) 
Hence, 2uy q, can be used aS an approximate error bound for 
3 ‘) 

the critical value computed from the approximate distribution. 
That is, we can say with (1-%)100% certainty that the true 
critical value is given by 

(5. 9) 


k = k 2 
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To illustrate the above method, the critical values of the 


simulated and approximate distributions of cr for a ily 
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n = LEA Ske) and N = 7000, and values of Uy for 
945% 

ao = .01 are computed and shown in Table 5.11. In all cases, 

the difference (5.7) is less than the corresponding 


YnGg,.01 ° Hence, we may conclude with 99% certainty that the 


true critical value for, say, q = .01 and n= 10 is given by 


in oy (We-1) = ry oy (10,-1) + .0332 


mi SHO tt, 03 320i. 


Critical values of the simulated and approximate distribution of 
Cyy iebe 28 = -.49, -.1, .49, n = 30 and N = 7000 are shown in 


Table 5.12. 


Having devised methods of estimating the degree of accuracy 
obtainable with the approximate distribution for various 
combinations of the parameters (n, PP) and approximating the 
SpLoLoan the CLitical) DOLAtS Computed from this, d2stribution sit 
is now possible to apply the approximate distribution in tests 
of significance for correlation. Since statistical tests have 
indicated good accuracy in the approximate distribution, for the 
small n values and P,?, in (=.5,.5), it May;be reasonable to use 
critical values, for these values of the parameters, evaluated 
from this distribution as the exact values, assuming the errors 
in them to be neglible. A table of critical values of r computed 
from the approximate distribution for various values of n and 


P4°. is given in Appendix Table D2. 
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Table 5.11 


Approximate and Simulated Critical Values of Iyy and the 
Corresponding Uy ae Values. . 
3 9 


N = 7000 , a = .01 


Tanple 5.12 


Approximate and Simulated Critical Values of Cyy 
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CHAPTER VI 


APPLICATION, CONCLUSION AND DISCUSSIONS. 


6.1 Application of the Approximate Distribution of Cross- 


We shall present in this section an example to illustrate a 
practical application of the approximate distribution of the 
cross-correlation in economic time series analysis. The example 
shows that using the approximate distribution it is possible to 
test for correlation between two processes (that is, two series 
of data) without having to assume that their serial correlations 


are zero. 


The time series dealt with here is the forecast errors in 
the interest rates on the U.S. Federal Fund given in a paper by 
Janssen [16]. Forecast errors for Friday, Monday, Tuesday and 
Wednesday were observed over a period of 33 weeks. The following 


notation was used : 
t <<: timelindex,” Thursday? (t=1), Friday (t=2), 
Monday (t=3), Tuesday (t=4), Wednesday (t=5) 


Vi 3 forecast lerrocesonedayst 


The forecast errors appear in Table 6.1. 
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Tabhen6. 1 


Forecast Errors for the Period 
(33 Weeks) 


4.1.68 - 30.10.68 


7 BAB: . an 


ay 
ee a. | 


a = 
ate: 3 ia =e 


cea exis 


‘po.tbvoe - - *a, a 


} cel q ‘7 e : > a = 
Le out cal 7% Te 
. 4 , - ola 
& a = 7 a = 
The ula nie eo- opie) 7 - 
100. 2.0 ~. get .0=. | 
| Bene St .0~ | tee ie | 
4.0 tat. QeTESD’ TCR Oa) 
| $58.9 P16.0- |) GeSeBe | SA De, 
| t= 160.0 04.0! 170.0. 
a. a pal, O26 8 oro .o) 
bes 0 per 0 ecove Oh ei 
| €6.8- PEP. B06,0— . REF 
OVE.0 vero re? 0 TF0.0 ‘41° 
ae ee oe 
. cv .5 923.6- act \ G~ s46.5~° 
vie Te. 080.9 re.o 
| + pee oe Ar. e<cr.0 (TOuD= 
UhetoY a a ay. O= BIT. ft 
| ra Pa Td ee er +/.0 t0,.0= 
i 4 ) ee azz. re, t+ 
| cn peta PFO.0 cosa 
aa ents cen.0 #00.06~. 
| vet iRie= 70.2 hiG te 
Pt ga ryt tif .0~ 060,.0= | 
et, ThE rri.0 rr .0 
| TSE O- ee Sort 900.0 (1 
cS0.0 40 0.0- car Ge oov.0 
TAT 0@ ‘SST a¥ §O0)e= eI Oe 
| re yn 908.9 Pues Ett .O- 
Get a EGt yo . a 
7 


> a 
(BLED. 6° 


In his analysis, Janssen used the t-distribution to test 


the hypothesis that no correlation exists between forecast 


errors of gtwo different idays; that is; 


o( Vee, Svs )e0 t= 920 (6.1) 


POT Ca 4,e eed Sande sea 3, tS t Fs. >The Petes ws cou 
correilaticn requires thats %at “least one of the variables be 
normally distributed. This condition is satisfied in this case 
Since the author has established, in an earlier part of his 
analysis, the distribution of the forecast errors to be 
approximately normal by hypothesis testing. The coefficients of 
correlation between each of the four series, and the associated 
t~values, as obtained by Janssen, are shown in Table 6.2. 
Critical values of the t-distrabutaon and the associated 


critical correlation coefficients are : 


tao = 1.31 Or Penk ae 6 BES) 
t 95 = 1.70 or Seu aee esta ~292 LOD alles yt 
t99,5 = 2a or mes 3005 - 443 

where r is. the © —critical point ‘of the sanple cross- 

ViVag> Qa 
correlation fC (that is, P[r “a0 ] = 1- 0 ), and a 
MEAs Vils ViVg?° 
value of Ty Vy is denoted by CAV eV) 


Janssen observed that the correlations between successive 
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days are large enough to cause rejection of the hypothesis 
(6.1), that is, rejection of the assumption of independence 
between these series of forecast errors. However, for longer 
lags (that is, not adjacent days) the results do not give 


reasons to reject the hypothesis. 


Table 6.2 


Janssen's Observed Coefficients of Correlation, r(v;,¥,) and 
Associated t-test Values between Series of Forecast Errors 
(Days). 


We shall perform here the test for correlation between each 
of the four error series using the approximate distribution 
(Eqn. (1.5)). To apply this distribution the autocorrelation in 
each series must be known. The autocorrelation of lag 1 for each 


of the error series can be estimated from the series of 
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observations in Table 6.1 using the formula (see Jenkins and 


Watts [18]), 


ip eal ie = 
(vet) = ge (es vn) ee eck uo) ' (6. 2) 
eS 
Lerviet Ft) = 
i=l 
where VAs Ss He ae ; 
Deeiicl ae 


Estinates of the autocorrelation of lag 1 for each of the error 
series are shown in Table 6.3. In order to be able to apply the 
approximate distribution we shall assume that the true 
autocorrelations (serial correlations) are given to be 

Doe eo OS, ONG = oli) See, Drs =.0O32, Pe = —..0192 | 
where 0 is a given serial correlation for the j series of 


j 


forecast errors. 


Table 6.3 


Estimates of Autocorrelation of Lag 1 
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The sample cross-correlations between each of the four 


error series are estimated by the following formula, using a 


sample size of 33, 


Shs! 
U(V te Vs) we Mti7 Ye) sir- Vs) (6. 3) 
re a FI eee ee eT 
Be SAK GUNS rapa hid Cee Wy, )2j1/2 
i=l i=1 
for tee el SO er eran pie enGe These estimates 


which are the same as those obtained by Janssen in Table 


6.2 are shown in Table 6.4. 


Table 6.4 


Sample Cross-Correlation between Error Series 
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To test the null hypothesis that 


es oN 0 
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£LQTae Ce 27089, NS = 3,04; em andes t frS7 Weratical valucs WoL 


Iyiv must be available. The relevant values of °.? are shown 
tYs os 


in Table 6.5, where p,and p, are the specified values given 


earlier. Using the approximate distribution in Eqn.(1.5) thea - 


critical values of ay acre conputed, fore.) =) .005, 901,) 202 5% 
Ss 


SU aeenl yg n= 33 and the relevant pp values, and are 
tes 


tabulated in Table 6.6. 


Table 6.5 


Product of Autocorrelations of Lag 1 
Pip 
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Table 6.6 


One-Tail Critical Values of fr 
VtVag 


h = 33 


Critical Values rf 
VtVo20 


-010 ~025 -050 


For Pete = .0549 the observed cross-correlation 
0 (V5/Vz) =-.425 ais less than the s0i25-=Critical,, point 
Tiga 28) -.359 for a two-tail test; and, hence, the null 


hypothesis of P (Vo ¢ Va) = 0 must be rejected at the .05 level. 
This implies that the cross-correlation between the Friday and 
Monday forecast errors has a negative value and is not zero; 
hence, these two series of error forecasts are not independent. 


Sri darky,, @50L Oy Pesan - 00.107 C(V, V5) = .345 which exceeds 


c = .344. This implies that the series of Wednesday 
VAG 2025 


forecast errors depends on the Tuesday forecast errors. The 
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other two adjacent days which indicate dependence in their 
forecast error series are Monday and Tuesday. Comparing the 
Friday forecast error series with those of Tuesday, 
(V5,%) = -131 < Troy 92/2 * and those of Wednesday 

Lit Voee Se) Bi =.052 > — Trove 40 2 their cross-correlations are not 


Significantly different from zero, implying that these series 


are linearly independent. Similarly, the cross-correlation 
between the Monday and Wednesday forecast errors, 
0 (Vz ¢V5) =e 13), okS NOL Significantly ‘dittcerent trom: zero. 


These results are similar to those obtained by Janssen. In 
using the t-test Janssen made the assumption that the product of 


the serial correlations is zero (that is, eae = Ojss nor ingother 


words that at least one of the series comes from a normal 


distribution and hence, had the Pearson correlation 


z 
Renate 
es 


coefficient distribution. In this particular example of forecast 
errors the serial correlations were relatively small, yielding 
even smaller products, see Table 6.5. Since, these products are 
all fairly close to zero, Janssen's assumption did not result in 
erroneous decision. Perhaps the Friday - Monday relationship 
should be considered in more detail. For testing (two-tail test) 
Janssen used the critical value 
TVnV3 1025 ",025 TSS i ier eo ne 
whereas the critical value from the approximate distribution is 
eee cal aos EIT Pees 0549) = .359 . 


Thus even for a small serial correlation product, such as 


op. = -055, there exists a noticiable difference in the 
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critical values. Suppose the observed cross-correlation I (Vo, Vz ) 
had turned out to be -.350, then Janssen would have concluded 
that the series are dependent, whereas the critical points in 


Table 6.6 would have lead to the conclusion of independence. 


The development in Chapter 5 showed that the critical 
points of the approximate distribution closely represent the 
true critical points of the true cross-product correlation 
distribution as long as the two series have smali serial 
correlations with the same or opposite sign. Thus for series 
with such serial correlations the Critical, sample- 
crosscorrelation values should be determined as shown in 
Appendix A instead of just sing the  critical® points) jot the 


Pearson correlation coefficient distribution. 


Emphasizing this point further, consider two series of size 


nh = 30 with serial correlation product of op -.49, see Table 


1° , 
Salen The eas. 025) Critical Value, as Found in. lavle 5. tc eis 


whereas, under the assumption of = 0 the critical point is 
=30 000,00) ees SO eer e 
ops sn i De ) 


Hence, using .361 the null hypothesis, Py, = 0, will not be 
rejected ( that is, series are not independent) when it actually 
should be. This could of course result in serious errors int 


practical situations. 
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To test whether or not the correlation between two series 

LS Significantly different from zero;) the’ “distribution-scf the 
sample cross-correlation, Cc, and its critical values must be 
available. McGregor and Bielenstein £305] have derived an 
approxinate Ul Ee ydastri bution, | (fqn\.an(l.5))) @ OL the —cLloss. 
correlation for two series of the linear, stationary Markov type 
with known (serial correlations) autocorrelations of lag 1, 
P4 and Po ° This approximate distribution depends only on 
the sample size, n, of the series and the product of the 
autocorrelations, 4°, . For the case where "4°54 = 0, this 
distribution reduces to the null distribution of the Pearson 


correlation coefficient (Eqn. (1.4)). 


To "Compute ~ critical” ‘values =10£ rc from the approximate 
distribution an algorithm has to be designed to evaluate the 
approximate density function, Be (cene P4P5) (Egn. (1g). 
Several terms in the expression for this density function tend 
to present overflow and underflow problems, which ace be dealt 
with in the algorithm, see Section A1 in Appendix A. The Guass- 
Legendre guadrature formula is used for the numerical 


integration of the density function. 


In order to apply the approximate distribution of cross- 


correlation in any valid test for correlation it is necessary to 
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determine the accuracy of this distribution and its critical 
values. The best approach to this problem is simulation since 
the Markov series required for investigation into the problen 


are not readily available in a suitable form in practice. 


A distribution of the cross-correlation is simulated by 
taking N observations of the sample cross-correlation between 
samples of size n of two series generated by two linear, 
stationary Markov frocesses of the forn, 

ten wer Pick tye Shee, 

Le tie Lae St ao 
where ie eae Known autocorrelations of lag one and Zs ; 
z: are independent N (0,1) random variables. To simulate 
realizations of the Markov processes the main requirements are a 
criterion to determine stationarity of the processes and a 
'good' random number generator which will generate independent 
normal random numbers satisfying various statistical criteria of 


randomness and normality. 


The process of searching for a random number generator for 
the simulation is focused on its properties satisfying normality 
and randomness, especially with respect to serial correlation 
Since the simulated Markov series should be unrelated. It was 
decided after careful consideration, see Section 3.6, that the 
generator proposed by Chen [7] would be most suitable for this 


Simulation. 


Before applying the simulated distribution, it is necessary 
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to determine how closely it represents the true distribution of 
cross-correlations. The Kolmogorov-Smirnov and Anderson-Darling 
tests are used to determine the goodness-of-fit of the simulated 
distribution for the case P,P, = 0, using the null distribution 
OPEC hesPeat Sone ecortelation scoctricient) shan.) (1.4) ). asemeche 
hypothesized true distribution. Using the Kolmogorov-Smirnov 
criterion, the error in simulation is estimated to be at most 

+ .02 over the entire cumulative distribution of cross~ 
correlation, with 99% probability for a sample of size N = 7000. 
Results of the Anderson-Darling test show that there is a good 
fit of the simulated to the theoretical distribution at the tail 
regions. A comparison between the simulated and theoretical 
critical values of r using Bahadur's theory on sample quantiles, 
see Section 4.8 , also indicates reasonably good accuracy in the 


Simulation. 


To determine the accuracy of the approximate distribution 


(Eqns (125) ) Lore jo # 0, a comparison is made between this 


12 
distribution and the corresponding simulated distribution, using 
the confidence band technique furnished by the Kolmogorov- 
Smirnov test. The error in the simulated distribution for 

oS 0 is assumed to be + .02. It is estimated that the 
error in the approximate distribution for values of Bem ace 
and n < 30 is less than + .04 with 99% probability. For 


high values of [Oats and small n (n= 30) the approximate 


distribution indicates a larger variance than the true 
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distribution, hence, resulting in an error that could exceed 
-O4, The results of tests performed on the approximate 
distribution seem to offer some evidence that for high values of 
eo the approximate distribution is accurate only for large 
values of n (Gay Soe Sei However, to be able to draw more 
definite conclusions on the behaviour of the approximate 
distribution with respect to n and p,©, more thorough testing 


would have to be performed on the distribution for a wider range 


of values of the parameters. 


The confidence band technique is also used to establish a 
Minimum sample size n with which the approximate distribution 
for a particular value of pace may be applied with reasonable 


accuracy. 


Using Bahadur's theory on Sample quantiles, an error 
DOUG ya can, can be estimated for each G=-critical 
value, Tag (Be 0409 # of the approximate distribution for 
values of P4Po# 0. The true critical value, Eng (Me 94 Pade is 
then given by, 


This paper serves to illustrate how simulation may be used 
to study a theoretical approximation to an unknown distribution. 
Simulating realizations of the true unknown distribution of Tyy 
has added a large amount of information on the ‘goodness' of the 


approximate distribution developed by McGregor and Bielenstein 
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PJOtpeerelee studies.) where data of a particular “class 1s not 
available, or where analytical or mathematical methods cannot be 
applied, simulation is perhaps the best approach to solving the 
problen. In practice, simulation methods have found wide 
applications only on powerful computing machines. The properties 
of these methods make them peculiarly suitable for realization 
on digital computers. Usually simulation methods are also highly 
dependent Oh the availability of a ‘'good* random number 
generator for generating a large quantity of random numbers on 
the computer. Hence, they are expensive in terms of computer 
cost since random number generation by computer is costly, see 
Table cC. Validation of the simulation results also demands 
careful choice and development of methods of testing the results 
to determine their representational value and usefulness, and of 


estimating the error in Simulation. 


In this thesis simulation was used to obtain more detailed 
information on the distribution of the sample cross-correlation 
lyy between two stationary, linear Markov series for both the 
true unknown distribution and the approximate density function 


given in [3}. 
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In our study of the cross-correlation distribution, we 
obtained, using a sample of 7000 simulated values of r, an error 
of 2% over the entire Simulated distribution. To obtain a 
reduction in this error would reguire a significant increase in 
the sample size N of er and hence the amount of computation. 
The rate cf convergence of the error in the ordinary simulation 
method is not high and is dependent on 1/N1/2 (see Shreider 
{34] and Table D1). Hence, to obtain an error of, say 1% over 
the entire simulated distribution a sample size > 25,000 would 
have to be used. This approach is expensive in terms of computer 
cost and it is obvious that very significant improvement of the 
accuracy cannot be gained by this means. This observation and 
the fact that the high computer cost involved in simulation has 
restricted our testing of the approximate distribution to only a 
few values of the parameters in Section 5.2, are quite 
indicative of the inefficiency and impracticability of simple 
Simulation methods as a means of solving distribution theory 
problems when the entire distribution is required with great 
accuracy. The ordinary simulation method cannot give any 
solution of very high accuracy unless special techniques are 
employed to improve the results. Therefore alternative 
approaches to improve the accuracy of simulation and 


modifications of computer simulation methods to increase 
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efficiency are worth exploring. Several methods, commonly known 
aS variance-reduction techniques, have been proposed for 
reducing the sampling variability of simulation to increase 
efficiency. Variance reduction techniques are important in 
Simulation studies since they enable one to improve the accuracy 
of estimates without increasing the number of tests (or 
Simulation runs). They are, in general, directed towards 
altering the probability structure of the simulation model So 
that efficiencies in computation are obtained. Some variance 
reduction techniques which have been suggested are : 

1) Stratified sampling 

2) Systematic sampling 

3) Importance sampling 

4) Correlated sampling 

5) Regression methods 

6) Use of expected values 

7) Russian roulette and splitting 

8) Orthonormal functions 

9) Antithetic variates 

10) Control and Concomitant variates 
Kahn and Hammersley [13,19,20] offer two rather complete 
discussions of variance-reduction techniques. A number of the 


above-mentioned methods was described and discussed in a recent 


paper by Gaver [11]. 


It may be possible and worthwhile in a future research to 


develop sone of the variance-reduction techniques for 


7 
‘ pa ian — we 


. r 7 - 
i) ny j 70 
eee 

7 


pimisqean a rot” aber 


wnt o> way I at 


iugsnoget ei2 Rang sn sned @ <_< 


noge wot everest Siac rey 


4 
7 


7% 


¥ . 


_ 
sade patank 


*$5@)! = egesaa -@As diacnieal 


ues «6 Batonesh 6 featap (8 ed fener Rite siowia 

*huc' wolrvaleede eet 9a GIs wahe  (abtitedeny oat jakz0 die 

izue pie baqkhete es shader - otvbaks Sis ois 
| oun PAROLE qs ipl unew sendand sadpiailoed. sobea 


rn et - 


guide 2% sak eas 
pasivese nidnmeseys ran sf 


taligem maaesxonat « a 


= 


val igese betaine? ee an 


ts 
mbocd Seat puliaeagell 
; mye Ly —_ 


G a 


neonda4 nemaaae® = nal 


> 
gah pent ten ashe ba isinaat a»; 
i ' dat —_ Tom ae : 
a _ 


SabLiqnea setahanaateed Os NT 

me ratauy aired ttsan a * 

ites 022 9 lila abet dl bee bo 

lenau suite qed aed36 | Los. eth Et i% = ms 


inn vtec a b sipnpagienah 84 sat rsuhiore e niaey 30 


_? or web : i 


ane ‘si yy 5 


an Sad. 


133 


application in the present simulation study to obtain more 
reliable and accurate results, based on which some concrete 
judgement may be formed on the over-all validity of the 


approximate distribution. 


Our study shows that comparison of the simulated 
distribution with the true or approximate distribution by the 
Kolmogorov-Smirnov test is expensive on computer time since the 
test procedure is laborious, involving at each generation of the 
r value, the non-decreasing ordering of the r sequence and 
humerical integration of the p*(r) function. Furthermore, this 
test measures the deviation around the region of maxinun 
discrepancy only and is not Sensitive to discrepancies in the 
important tail regions, which may be small but are significant. 
This indicates the need for more efficient and reliable methods 
of comparing and testing Simulation results, and hence the scope 


for further research. 


While the approximate distribution of cross-correlation may 
be used to test for correlation between autocorrelated series, 
its application is limited since it is completely dependent on 
knowledge of the true autocorrelations which are usually not 
obtainable in practice. Furthermore, the results of our 
Simulation study seem to indicate that for high autocorrelations 
of same or opposite sign the approxinate distribution is 
accurate only for large n (that is, longer series). This imposes 


another restriction on the applicability of the approximate 
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distribution since in practice series of data available for 
testing are usually small. Hence, it would be useful if some 
distribution of the cross-correlation coefficient could be 
developed in the future for application with estimated values of 
the autocorrelations and small samples of data even when the 
absolute value of the serial correlation product is greater than 


ae 


Other methods of studying the cross-correlation between two 
series are also worth exploring. Spectral analysis is one 
approach that may be taken. Since the autocorrelation function 
and the spectrum (and the cross-correlation and the cross- 
spectrum) are Fourier transforms of each other, they are 
mathematically equivalent and therefore have equal 
representational value. Hence, the spectrum may be used in place 
of the autocorrelation as a tool in building simulation models 
and spectral analysis of the results be applied. However, the 
choice between the spectrum and the autocorrelation as a tool in 
model building depends upon the nature of the models which turn 
out to be more useful and efficient. It would be worthwhile to 
conduct further research into the possibility of applying 
spectral analysis to multivariate time series in order to derive 
a distribution of the cross-correlation which is less resticted 
in application than the approximate distribution that is being 


studied. 
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APPENDIX A 


ALGORITHM FOR COMPUTING CRITICAL POINTS FOR SAMPLE CROSS- 


CORRELATION 


Admevaluat) CumotemD) (42n 7 noises 
4 2 


Since the approximate density function of Iyy eeriGe yr. cls 
an even function of fr, it is symmetrical about r = 0. Hence, 


oniy = positive (or mpperjMcritical, points for is need to be 


Le 
considered. 


* R 
Let p (r3n,o,o, ) denote the approximate density function 


ee. 


of lyy for a particular set of values of the parameters 
Beep On). Lets ir be the upper critical point for fr at 
Loe sl 0) O3Ds P4P5 a PAN 
the a- level of significance for that particular set of 
parameters. f€ may be determined from the relation 
a3;n,P,P, 
3g 
O%5NsP4P5 
it p*(r3n,P °) see fue : (A. 1) 
0 


Computation of critical points by the above equation involves 
the numerical integration and evaluation of p (c3tve.0,) for 
ranges of r-values. A Fortran program is designed to accomplish 
these two tasks. For numerical integration , the Guass-Legendre 
quadrature formula is employed. A detailed description of Cis 


method of integration is given in Krylov [24] and Shroud [35]. 
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The evaluation of P’ (Ei Rep p, ) by computer programming 
presents both overflow and underflow problems in various terms 
ofthe expression in Eqn: (1.5). The following Section will 
discuss the difficulties encountered and methods of avoiding 


them. 


Consider the following terms in the expression of Eqn. 


(1.5) for the approximate density function, p*(r3n,P,P, ) : 
M-3 
K = 2 (1 Poe oa) 474 (A. 2) 
BI$M- 1,4) 
and 
-4)/2 
Doc K(1 - re)" y/ (A. 3) 
where 
BIZ M-1 , 3] = eto eee) 72) ' (A. 4) 
P(su =~) ) 
Cc = Cl + See ono ee jiv2 e (Ad) 
MH = nt p.p,(6 - 5p,p,) (A. 6) 
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Then Eqn. (1.5) may be written as 
*x 
(r3n = pD[c + (1 + 1/2 C : A.T 
P ’ pe.) [ ( PsP. ra / ( ) 


From Eqn. (A.6) it can be seen that M is large for large 
values of n with absolute values of close to zero. That is, 
as nn? © = Aa. 
’ { p4P. | 
For small values of n , M becomes negative for values of 


closestor-— ile. Theat, 1s, fom snall n 


M Ka = tee) as -1 e 
Peo 
Fig. Al shows the variation of M with values of n and pop. 
dee 
However, the restrictions, 
eh Pee for (N=? 1) to have a positive argument 


and 


eee for the exponent of) (1 — £2)” amv Eegn. (1.4) ) to be 


positive, 
impose a lower bound on the possible values of eee FOr -smalt 
Sample sizes. For n=6, 
2 = = 2 
M 6 + ere 5 P,P.) JAA (P,°,) J >. 2 


requires that 


p/p > —~.42 : (A. 8) 


Hence, for small sample sizes, choice of the parameters Pi 0 Po 


must be subject to the condition (A.8). 
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vM- 
In K (A.2), both the BETA function and the factor 2 , 


produce overflows for large values of M. To compute the 
BETA TUNC ELON Sean ..(A 4) 1S5eused. FOr, oA 2 ito overflow 
condition will result in the computation of the GAMMA 
function since for arguments greater than 57, values of the 
GAMMA function exceed 1075. In order to handle large values 


of M, the following Triplication Formula [14] is used 
3z-3 
iPTEeAy Ee} P(e) lz) T(z + § ey oer ae ee AS) 
This formula can handle values of M&M AS Lange sas 346 Or 
larger values of M, the following Guass Multiplication 
Formula [14] may be used : 
(1-k)/2 _ kz-4 n-l 


rikzy) = (27) k ire + i/k) : (A.10) 
1= 


For Me 0, the factor ee in K will produce 
overflows. To avoid this, the kth root (such as k = 3) of K is 


first computed by 


Kemet 2g; emi hee Noe ol) a) 


and the resultant pets value is then raised to the power k at 


an appropriate stage of the calculation. 
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3M- 
In De (ANI3)% * ther tern Ge r2)* Z may produce 


underflows since it approaches zero rapidly as r tends to one. 
th 
TO prevent undertiilows, the d BOOt (Such eased a= 50 )r oC eee 


computed by 


pa. gy - r2) Midj/2d, pel Mp = ay yee Cay 
Va ee 
The resultant  D value is then checked against a specified 
1/4. 


smalifewalue «© . If 9p <Sey g SDOSISe Sete equalPtor zero; and 
ehewvaelucwor the integral sin] Eqn.(A. 1jie 2s) enol s changed. sor 
example, for d= =€85066, € is taken to be 0.5 since 


(005)200m= "10°15 


Taking into account the above considerations, an algorithm 
and Fortran progran may now be developed to evaluate 
P"(E50,0,0, ) for a given set of parameters (NeoL0,, )- The 


following stepS comprise a general procedure for evaluating 


* 
p CE DEE Ae er ) at a given r-value ;: 


1) Compute the value of M for the given set of parameters 


(0, ee uSing Eqn. (A.6). 


PA WEE al SG eae P4818 incremented by 6 (that is, &5 = %Pot6 , 
where 6 is a small increment such as -O1) until M 
becomes greater than 2. This step ensures the proper choice 


of p45 for the particular small sample size n used. 
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Compute BLSM - 1, 4] using Eqn. (A.4) and formula (A.9) 
or (A.10) for the GAMMA functions (depending on the 


magnitude of M). 


ih 
Compute K / by Eqn. (A.11) , (using, for example, k 


= 3). 
1/k 
Raise the value kK to the power k. 
il , 
Compute D by Eqn. (A.12) , (using, for example, d = 50). 
t/a 

3 Bry 8 as) fs (using e€ = .5 for d = 50 ), set D equal to 
zero. 


tf pidd>e PLease itcevaluce tothe: powered. 


Compute C by Eqn. (A.5). 
*x ; 
Evaluate p (3D, pp.) using Eqn. (A.7), 


(p** (c3n,P,0,) = p(rin,?4?,))- 


To facilitate the handling of very large numbers and at the 
time achieve greater accuracy in the computation of 


n, pp), double precision arithmetic is used to program the 
Lite 


above algorithn. 
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A2 Procedure for Evaluation of Critical Values of yy + 


Using the method developed in Section A1 for evaluating 
p* (rin, 9,9,) and Guass-Legendre quadrature formula for numerical 
integration, an algorithm is deviced to compute the critical 
points of ryy for appropriate sets of the parameters (n, °;°). 
The method of interval bisection is used to estimate the 


critical points to a desired accuracy. 


Let I, denote the area under the p*(r3n,p distribution 


4°) 


curve as indicated in Fig. A2. Then Ig is given by, 


Poet, 04 Pl 


Gea oP) (Een, On me Orwe—o 728 : (A. 13) 
0 


ROortany point Csr <4(0 e791), eclet 


i, 
Bho Wa oe Ger WA Cenc Coke ‘ (A.14) 
0 


The algorithm for computing fr proceeds as follows ; 


a3N,p4P9 
St ieoet otras = 070 = (bali oie | 
Compute 


: 3 (aot) 22. 


2) Compute the area Ij by Eqn. (A.14) using Guass-Legendre 


guadrature formula for integration. 


S)eolt | ITli- Tcl ee & for ‘a prescribed’ = , ©3445 
taken to be the value of fro . {It is possible to arrive at 
this inequality since the Guass-Legendre quadrature formula 
used to compute Ii and Ig is known to converge for the 
p*(r3n,4P>) function (see Shroud [35] and Krylov [24}).] 


Otherwise, go to step (4). 
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step (1) to compute the next approximation. 
iid ah ag oe oS WW) VR ES Gy iss ie Ske we Oe | and go to 


step (1) to compute the next approximation. 


The critical points computed by the above algorithm will lie 


within +e of the exact critical values for the approximate 


dr StrErpuci on. 
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APPENDIX B 


JUSTIFICATION OF THE ASSUMPTION THAT THE ERROR IN THE SIMULATED 
DISTRIBUTION FOR P,P5# O IS THE SAME AS THE ERROR OBTAINED FOR 


THE CASE WHERE ?,P5= 0. 


Wwe note from the plots of the approximate 
distribution p* (r) ine rqsea se cthat the distributions for tie 
Var1ouss values» sof 9705) Sarcee®=sinmiat, in. form ' to the 
normal N(0,o0 ) distribution with zero mean and variance o?7. 
For the case P94 99= 0, p(r) is approximately normal for large 
nh. Hence, let us use the normal distributions to validify our 


assumption concerning the error estimate made in Section 5.2. 


Let us consider the N(0,1) distribution as analogous’ to 
the Pity) cpstriapucion —Lor P,P, = 0. Similarly, let the 
N (0,074) and N (0,052) distributions, where O4 > and 
On< 1, be considered analogous to the p(r) distributions for 
pegs QO and P,Po< 0, cvespectively. Making an arbitrary choice 
of the values of 0°, and 9, , we simulated the N(0,1), N(0,-5) 
and N(0, 1.5) distributions, using Chen's random number 
generator and a Sample size of 7,000 in each case. We repeated 
this set of simulations twice, using in each case a different 
pair of seeds to generate the sample of random numbers. For each 
of the distributions Simulated we computed the corresponding 


Kolmogorov-Smirnov statistic D(N). If our assumption is valid 
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be less 


ults of the simulations are 


Kolmogorov-Smirnov Statistics for Simulated Normal 
Distributions. 


748,511,649 
147, 303,541 


D (N) 


7000 
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nine distributions are les 


with 99% certainty th 


distributions is less than 
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distribution with P4P5= 0 
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the values of D(N) £On. fade 


-0172. Hence, we can conclude 


error in all nine Simulated 


This serves to justify our 


estimate for the simulated 
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APPENDIX C 


COMPUTER CONSIDERATIONS IN SIMULATION. 


Simulation programs are usually written in a high level 
language. Fortran IV is used in this simulation study since it 
is a readily available general-purpose, problem-oriented 
language most suited to the nature of the problems encountered 
here. The normal random number generator suggested by Chen is 
written in Fortran and was found to perfcrm reasonably well on 


the, LEM 360/67. 


The major costs of computation involved in this study 


result from the following : 


1) Generation of a large number of normal random numbers in 
each simulation of the cross-correlation distribution. Each 
Simulation with 7,000 sample values requires over 200,000 
random numbers. Approximately 3 minutes are required for 
each Simulation run on the IBM 360/67, using Chen's random 


number generator. 


2) Testing the goodness-of-fit of the simulated distribution 
by the Kolmogorov-Smirnov test. This test involves, at each 
generation of the r value, the non-decreasing ordering of 


the rc sequence to compute the empirical cdf, and the 
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numerical integration of the function p* (30,0 ,0,) to 
obtain the theoretical cdf. The numerical integration in 
the test is done by means of the 32-point Guass-Legendre 
formula which integrates functions up to degree 63 exactly. 
A complete test on the simulated distribution by the 
Kolmogorov-Smirnov criterion, using a sample size of 7000, 
required about 10 minutes of computer time. It is possible 
to reduce the amount of computation in the integration by 
employing a guadrature formula of low order (that is, with 
less number of base-points). However, this would require 
knowledge of the degree of the p"(r3n, 0,2.) function to 
be integrated before the formula can be applied 
appropriately. Using the 32-point formula this requirement 
can be avoided since the formula can handle functions of 
high degrees. Furthermore, for the range of values of the 
parameters, Ny P4Poe considered, the degree of the 


p*(r3n, P40 function is not likely to exceed 63. To 


>) 
SMpLovelrstheseaccuLracy in computation, double precision 
arithmetic is used to program the evaluation and 


integration of p*(rjn,P4P5). 


Due to the high cost involved in simulating and testing a 
distribution with a large sample, the error analysis for the 
approximate distribution and its critical values could be 


performed only for a limited number of values of.the parameters 
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(n, P50.) Table C below shows the computer timing obtained for 


some runs of the simulation and testing programs. 


Table C 
Computer Time Statistics. 
(CPU Time Used) 


Type of Run Approx. Time 
Used (secs.) 


Generation of 40,000 normal 
random numbers. 


Generation and testing 
(by Kolmogorov-Smirnov test) 
of 7,000 normal random numbers. 


Simulation of cross-correlation 
C1Sfr bution. OL 
PRpoe Noite, he 30, N — 7000 


Simulation and testing 
(by Kolmogorov-Smirnov test) 
of distribution for 

ay 2G eis Os en 7000 


-10, n 30, N 7000 


> 
3 wldar 5 , 


pemy 12 
uw iebbest salt’ segEqeo9 
iy ° >y 
(wet wees ~oS) ° >» 
Ss ee a ee a a. : 
ae ab to agt 5 
(ryMhaes; pat - : : seas i : 


dja, 00 Lesbows: " ~ 
| - 28K to oe | ; 
| *" aeena: aohasz _ 


| notiab?. Daa aoks s2ed 
Lnass autLet TORORORLOA ¥0) _ 


” 
7 


itoles3e2~eNos% nos aods steate.(' 


. 


2 
_ 
1] 


: 


j 
PTE Ssetavn eobass Jeqane a0, 0 to. 
| 


203 te eam 
HAGT * & ,0€ — mer = gop ay 
p2ritass bay noes % | - 


tums vorsist poulo® eddy E 7 
a3. go esedis? | : 
ei ueet 7 @ ae “a > - sae bo 
sar eoot + @ yO ~ as it tees | 
7 ™ 


, 7 


ee : : 5 } 


150 


APPENDIX D 
Table D1 
Critical Values D,(N) for the Kolmogorov-Smirnov Test 


such that 


P{ Max [Fo(lsn,0,0,) - Fo(tin,p,0,)1 > Do(N) } = a 
1 


Significance Level a 


05 - 10 


0.669 0.565 0.510 
0.618 0.521 0.470 
Gist) 7. 0.486 0.438 
0.543 0.457 0.411 
0.514 0.432 0.388 
0.490 0.410 0.368 
0.468 0.391 0.352 
0.450 0.375 0.338 
0.433 0.361 0.325 
0.418 0.349 0.314 
0.404 0.338 0.304 
02332 0.328 Vere 
0.381 0.318 0.286 
0.371 0.309 0.278 
0.363 0.301 Ora 
0.356 0.294 0.264 
0.320 0.270 0.240 
0.260 0.240 0.220 
0.270 0.230 0.210 


1.63N-1/2 1.36N-1/2 Te2ZNat/e 


Adapted from Massey,F.J.,Jr., [29] 


t wa : _ 

_. @ wy ~) ; 
i fa pa ae i 
er a eye 


jeet «oes rede vngrigp aden aa ' 


J£6 d » = 


: + Lie < whe seehigt #oysinatag 
OR: 7 


q 
——— ae a a eae 7 
| r japed SOHAGS tia eee - 
) Of. a FG 
a an eR a 

| sre 0 Paes Pea P3049 
Gve oS sf)  Rr2.0 

. hie. o aaa wl The.2 
fie>0 teal {ENE oD 

; 56F,,0 L2e.6 me A 
HAE 1G ORY .@ een .8 
See b ' ‘e,4 Poe a 
oe & HVE 40 Veo mt 
+r. of Bt. 0 cry ,6 

| ge’. O LS ae | 6? ¢@.0 
apt ,' ra oe ae Or .G 
=i! ~0 ‘y eh 4 £0 
66.0 1#t <0 St .0 
wh ss } Le 16.9 my a 
ers. ree.f) bot 
’ eb #25. 50 44.0 
oF... Oyece . ac .t 
ock.0 (4558 eS .O 


ort Hoga oye .@ : 
c\repie.? . See eee aod. 


@ 
a Ge 


(ery uae oes 
vw. a 


ee 
es [Fas | 


fgets 


68e 
96° 
Line 
Linh © 
ote 
00S° 


£o9. 


B8h° 
G0S4 
SES: 
79S * 


£65" 


B8LL* 


Lic ° 
bos. 
f6G° 
LZ9° 
(Sicjee 


OLL* 


72S ° 
CES 
6S ° 
20h 
009° 


el iy eas 


9692 201s 
On9° fn0L* 
GS9° 
oLoe 
LO 


Lo8° 


» AXy  squatotyyaog wOT}PPeTaIIOD-ssoID FO SonTeA TeOTHTIAD 


ed eTqeL 


= 
== 
— -—s 


ae ec 


(dao bo lived 2 aot: ibaa a Flan te 


7 — — Pe ee eee —— a ee 


ter. ~~ ° grt. 
hes “REFx ee, 


OCPHe a0ue, ze yee... 


ie) 


{2 ] 


C3] 


5 


[5] 


[85 


152 


BIBLIOGRAPHY 


Anderson,T.W. and Darling,D.A., (1954) yar ' aA Test of 


Goodness-of-Fit. ! 


Ay het ie Vol. 49, PhS, = 169. 


Bahadur,R.R., (1966), ‘A Note on Quantiles in Large 
Samples.! 


Atnvalsvorm Matic Stat. ,)°¥0l.o 1 yy Diue =" SS0e 


Bielenstein,U.M., (1963), ‘The Approximate Distribution 
of the Correlation between Two Stationary Linear 
Markov Series with Fitted Means.' 


M.Sc. Thesis (unpublished), University of Alberta. 


Birnbaun,Z.W., (1952), ‘Numerical Tabulation of the 
Distribution "ol etKolmogorov"s=*Statestic Tor Finite 
Sample Size.'! 


Usha. hes Vol. 47, 425 es G41. 


Box,G.F.P. and Jenkins,G.M., (1970) Time Series 
Analysis forecasting and control. ' 


Holden-~Day. 


Box, Ger... and Muller,M.E., (1958), ~‘A Note) on the 


Generation of Random Normal Deviates,'! 


| enaditasiaeds 
7 alesis 7 sd 


_ 
. ee 7 
/ _ 7 > 5 7 : > x= ; 
inap? >| a | a ; » i> o- a? 6.4 
jeat <8 en te e » 8 
‘s eh - aan 
: a) : 7 - 
hat \ ae = | 
; a 


| wh subd sredct no SIA +" es gia ton :2 ie sj 


‘Me 7 «petqeet ar > 7 7 


: ; } | = ast 
od2 - TAR 4TE che’ o29t2U. 9108 30 winagh 
. 7 ; : 
7 Ss 7 
diavae0 stewtsteg’) Gar) . (lett) «8 sy siovaneieta ot > 


r78aysrasn CONF Ader Fou Gyre afeTrigs oad 30 | ; - 
, - a 
* maree?, Gees? ® dotw'wntcat vodzeall yr 


&) 


20/54 Se or laspeasile, Pndedicegst) atcott <oe.8 a a 

. . _ 
30 soisalodet ‘Sewkdeten’' , pecrr) * eal tyanndaale $09) 2 
4 o> shepesagh cA ¢irnoneaste Io aot tpal eek) vi 
| * onke oiqaet 
oe — 25H he +4 hake 
~ 


é 
_ 


“ye 
4 
a 
Le 


‘ , dren mOeakhane. | 


(7) 


(8 j 


(9) 


Pata 


PA 


(12 ] 


[13] 


123 


Piiabowor Math. Stat., Vor. 29,575 9610 — 611. 


Chen; Gate, (1971), ‘A Random Normal Number Generator for 
32-Bit-Word Computers.! 


WeAeoo.Ae, Vol. GO, 400 an 403. 


Coveyou,R.R. and MacPherson,R.D., (1967), ‘Fourier 
Analysis of Uniform Random Number Generators.! 


Journal of ACM, Vol. 14, Now 1, 100 ~- 119. 


Downham,D.Y. and Roberts,F.D.K., (1967), ‘Multiplicative 
Congruential Pseudo-Random Number Generators.'! 


Computer Journal, 10, No. 1, 74- 77. 


Feller,wW., (1948), ‘On the Kolmogorov-Smirnov Limit 
Theorems for Empirical Distributions.' 


AnnalsSeOpuMatie Stats, aV0Ols 19--20,6 Live too, 


Gaver,D.P.dJr., (1972), ‘Statistical Methods for Improving 


Simulation Efficiency.' 


Gorenstein,S., (1966), ‘Another Pseudo-Randon Number 
Generator.'! 


CORRS =ACHT 97° 105 — 711 


10 
o) 
je}) 
4H 

IS 


Hammersley,J.M. and Handscomb,D.C., (1964), ‘ Mont 


: a . 4 ad 
_ " 7 ie 7) 
an | 


ee 2 J 
Ot. . te 
ig 


A rn 
+ 
mis 


aT - 
vs 


a 


ptrise seven Entel cnet asm 
inn 

<0 = Oe” QR mai hte - 

7 : 

wa? 6 yg CTR ree fu. a ct Tern Tek oo 1 eC 
+ yeteree Gone abt dpenbat ke ategtant 


av Lint ek 20 Leaveny 5 6 


9 f ~ Us P| O58 
| r a 
se. 
aiiokeiag® , (THeTy © aoe. J, aue Baa. ol ‘Gymadayee “VG i 
, ye 
' 222" b chum 300808 SahAnt-alvegs fai? aevipsod ’ a 
ry = 9 g’ ion htt ain@aegt sesoqae? 7 i bi 
r) . “ e eas in a 
yo@ezbet-'o;epouley eu ‘a |, ter) «+8 t0Lies ery” 
La - ys 
‘ , @so0janaieeeie iposiqged Tes peetas.t ’ _ 
eee = C1 . Mae eka te haan a0 shoaod » a 
; : ' i 7 7 
colt 7 


a = _— 
wel 294. aba tok ipasuehande* ~teV!ls se tiaeyaoan I 4 


paw zeae oe *< 7 


at 


Ry 


4 evubteh-et 2009S foal ae wei " : a 
| widen 


_" 
- 


re 


[14 } 


[15] 


[16] 


ee 


[18] 


[19] 


[20] 


154 


Methods. ! 


Wiley and Sons, N.Y. 


Handbook of Mathematical Functions , (1966), National 


Bureau of Standards, U.S. Department of Commerce. 


Hutchinson,D.wW., (1966), ‘A New Uniform Pseudo-Random 
Number Generator.'! 


Comm. Agere eh 6, 432 - 433. 


VJaNhNSSN, Ce ieslen, CUES AUR ae SM a Y= Management of Bank 
Reserves. ! 
(Unpublished), University of Alberta. 

Jannson,B., (1966), ‘* Random Number Generators. ! 


Victor Petterson, Stockholm. 


Jenkins,G.M. and Watts,D.G., (1968), ‘ Spectral Analysis 
and its Applications. ! 


Holden-Day. 


Kahn,H. and Marshall,A.W., (1953), ‘Methods of Reducing 
Sample Size in Monte Carlo Computations.' 


Operations Research, 1, 5 


Kahn,H., (1956), ‘Use of Different Monte Carlo Sampling 


ov 4 {eee ‘ 


OS 2enTG> DO tieue ; ae 


-_ 


-chcen2  e1ctint wet 8 


i ipetepaden wd? QttFtny . 


! 4 


2 A ae 
; : Te Bey 7 
LAB Siena + (dina tenaw) 
7 


(21) 


[22] 


[23] 


(24) 


[25] 


[26] 


ipo 


Technigues.'! 


Keeping,E.S., (1962)7,78 TOeCrotuectvon 6 to Statistical 
Inference. ! 


Se es ee Se eee Se 


Van Nostrand. 


Kolmogorov,A.N., (1933), ‘Sulla Determinazione Empirica 
di une legge di Distribuzione.' 


Giorn. Deill'Istit. Degli att., Vol. 4, 461 - 463 


Kronmal,R., (1964), ‘Evaluation of a Pseudo-Random Normal 
Number Generator.! 


JOUnTAl OL ACM 11,7 357° 3035. 


Kpylov,V-0> , GI362) Approximate aiculatio of 
putegqrads. | 


Macmillan, N.Y./London. 


Lewis,P.A.W., 'The Anderson-Darling Statistic.' 


Lewis,P.A.W., Goodman,A.S. and Miller,J.M., (1969), A 


Pseudo-Random Number Generator for the System/360.! 


IBM Systems Journal, 8, No. 2, 136 - 146. 


nfond ti ae 


- 


at se Pi 
+ 30 be keer (28 otgeon 


> 
eanrl4.2ees nav a 7 1! 
1 _ ' 


Lue 2 ers 49 


i, #osdppeloa .[S 


' sqgoleid itil & epp-s saeco os ns 
fin 8 1407 4778S Liead vic. 2011 {ith aazetar a _ 
- 
a ~s 
“4 ft ve 1 10 Sex oe | : aT } % ; {f-4OT) , Oy tbeannak £E5) : 7 
' -sapeeer secerd 7 
fee = ter-. tt ASS De leaziot 
| oo 
siteitnie? olgaheemyed © (2801) —yevgvotyad YRS 
bd ie 2 7 a ‘ ae 
* ndezpede BS 
. a : 
et Ga ey rr | 7 pe il ieone *- er ~ a 
Pe errr ‘ : 
’ ob9alasds lagi-aee re bud, 649" _eVedeG cia 
, eet) icp SOL LEM Dans. Poh, sebooe 


Pe sy 
Ceo teE ott 10% Te teabeen senen wobs "ig 


40h = 60! 4 :O8) 4# sinainon a 


{27] 


(28] 


{29] 


[30] 


[31] 


[32] 


{33] 


156 


Marsaglia,G. and isa Vig Nene’ (1968), ‘On-Line Random 
Number Generators and Their Use in Combinations.' 


Comme, kCN ti, 11, fo? =) 759. 


Marsaglia,G. and MacLaren,N.D., (1965), ‘Uniform Random 
Number Generator.' 


JOULNAL OL ACM, 412,,0.83 —239): 


Massey,F.J.Jr., (1951), ‘The Kolmogorov-Smirnov Test for 
Goodness~of-fit.' 


Wistho aL <9 Vol. 46, 68 =e ao. 


McGregor,J.R. and Bielenstein,U.M., (1965), ‘The 
Approximate Distribution of the Correlation between 
Two Stationary Linear Markov Series II.'° 


Biometrika sevoleso2 FENG. Ol=—27 66301 — 302. 


HitiersL.H. 9 (1956), > “table of Percentage Pornts of 
Kolmogorov Statistics.' 


eA eS SA ay Fs dil tk ee Ls 
Oak Ridge National Laboratory, (1968), ‘ Pseudo-Random 
Number Generator.'! 


University of Alberta Program Library. 


Seraphin,D-S-, 9(1909),, “A Fast) ‘kandon Number Generator 


t ara a (a 
dong ak: 


> adres 


e peveEinf* _icORr ote Hi ailnaet é 
\ ogteu 


ie poen aed r ae 
_ 

te + £8 get ‘aaah tewtnot 7 

2 > - 
2' gooslet-sozcpom.e® ef? < peehy > ste bo Beg 
. _ : - = 
+ Jta-t0-ennakeod 
: 7 

GT — GA»: ee pe LOF ee ee - 

: 7 
| ~ae* 
, ghee) yO sh Matar iod Pak +e -L,Aopss008 TeSs 
T - 


7 


of vo ltules963. ans. Se ROS? pat: 4¢4Q, SR 
‘Ty, wane ypaead vesues airhige: om 
ROE - $88. EP ied. ce CoD sttsoemoie 


id ‘ele it pia 


34109 efhs agentes to. aidat* (oer ae 


(34 ] 


[35] 


[36] 


(37 ] 


[38] 


[39] 


tor 


Foresreul 360." 


ComisaACi, lz, 2 O05. 


Shreider,Y.A., (1966), ' The Monte Carlo Method. ! 


a ae Swe ee Sa ee 


Pergamon Press, 


Shroud,A.H. and Secrest,D., (1966), ' Guassian Quadrature 


Formulas. ' 


Prentice-Hall 


Smirnov,N.V., (1944), ‘Approximate Laws of Distribution 
of Random Variables from Empirical Data.' 


Uspehi Matem. Nauk, Vol. 10, 179 = 206. 


Smirnov,N.V., (1948), ‘Table for Estimating the Goodness- 
of-Fit of Empirical Distributions. ' 


ADGA SmO mM ata La tery ke log, ee on Oi. 


Poonen, &- 0.941963), ithe Art of Simulation. * 


——— ss —— ae ee ee eee 


The English Universities Press, Ltd., London. 


Wolfowitz,J. and Levene,H., (1944), ‘The Covariance 
Matrix of Runs Up and Down.’ 


Annas SOCeMatissestdte a Ul, ee LOO 


‘ : 7 
nigok «hs yeeeant el 2oRedtet 4 
FanO@UeOGSTILG 49532) 


7 
7 
- 
_ 7 
- 
+ 
° 
- - 
, a 
7 3 


ont . 
- ; a ; 


a ry 
a 7 1s 
- a ty 
>! ne 
at a 
as. 


y 


= 
ee 


Suepentumis 


